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0.1 List of symbols and terms 

General symbols 

is the empty set. 

A (Z B means that the set A is contained in the set B, with A = B allowed. 

S is the closure of a set 5" in a topological space. 

C is the set of complex numbers. 

N = {1,2,3,...}. 

M is the set of real numbers. 

f* — f is the complex-conjugate of a complex-valued function /. 
A is the Lebesgue measure on M^". 

£(y) is the space of bounded linear operators V ^ V on a normed space V. 
Tr denotes the trace of a bounded linear operator on a Hilbert space (see [Mu]). 

Symbols defined in the text 

A < B in a, *-algebra, 2.7 
B^{F), 1.7.2 
5oo(M2"), 1.3 
B^{y^), 1.7, 2.1 
L{V), 2.2 
tr, 1.6 
Xa, 1-3 
X 2.3 

||-||^ where ip is a. state on a unital *-algebra, 2.2 
[•], 2.3 

Terms 

accurate, precise, 1.4 

bounded quantum system, 1.7.3 

Cauchy-Schwarz inequality for states, 2.2.2, 2.5.6 (in the proofs) 

conditional probability, 1.3, 1.5, 1.6.1 

constant energy surface, 3.2.8 

density operator, 1.2 

ergodic, 2.3.2 

factor, finite factor, 1.8.2 

finite von Neumann algebra, 1.7 

faithful, 1.7, 1.7.4, 2.7.1 

flow, 1.3 

Hamiltonian flow, 1.3 
ideal measurement, 1.5.1 
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information, 1.4(b), 1.6.1 

measurement, 1.1.2, 1.6 

noncommutative information, 1.6, 1.6.1 

normal state, 1.7.4 

phase space, phase point, 1.3 

pure state, 1.3, 1.4(b), 1.7.4, 1.9.2 

*-dynamical system, 2.3.1 

spectral projection, 1.4.2, 1.2, 1.3 

state, 1.2, 1.4, 2.2 

unital *-algebra, 2.1 

All Hilbert spaces are assumed complex. 

We will use units in which h — h/2'K — 1, where h here denotes Planck's constant. 
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0.2 Introduction 

The main motivation for this thesis is to gain a deeper understanding of the struc- 
ture and nature of quantum mechanics. This will be achieved by a careful analysis 
of the relationship between quantum mechanics and classical mechanics. Quantum 
mechanics is inherently a statistical theory, while classical mechanics is not. The 
essential idea is therefore to study the general structure of statistical mechanics in 
a mathematical framework that applies to both quantum mechanics and classical 
mechanics. The language of abstract C*-algebras is ideally suited for this, since 
it provides a unified formulation of quantum mechanics and classical mechanics, 
with classical mechanics then viewed as a special case of quantum mechanics where 
we have commutativity. The concrete realizations of the C*-algebras in quantum 
mechanics consist of linear operators on Hilbert spaces, which are mathematical ob- 
jects that differ very much from the measurable functions that make up the concrete 
realizations of the C*-algebras in classical mechanics. For this reason the abstract 
approach clarifies the general structure of mechanics (quantum and classical), en- 
abling the above mentioned unified formulation of mechanics. This is discussed in 
detail in Sections 1.1 to 1.5 of Chapter 1. 

From a mathematical point of view the general structure of classical mechanics to 
be presented is nothing more than probability theory (or, a probabilistic description 
of information) with dynamics, while the general structure of quantum mechanics is 
noncommutative probability theory (or, a probabilistic description of "noncommu- 
tative information" ) with dynamics. From a physical point of view the information 
referred to here is the information an observer has regarding the state of the physical 
system in question, while the dynamics describes the time-evolution of the system. 
The mathematics then suggests an interpretation of quantum mechanics in terms of 
the idea of noncommutative information, which clarifies several conceptual problems 
surrounding the measuring process. This interpretation is discussed in Section 1.6. 

As is implied above, our view of statistical mechanics description of situ- 

ations where we have incomplete information about the state of a physical system 
(quantum or classical). In practice this is generally the case, since exact measure- 
ments are impossible, except for some simple quantum systems whose observables 
have discrete values which are separated enough to be distinguished by our measure- 
ments. If an observable has a continuous spectrum of values, then the best we can 
hope for when measuring the observable, is to obtain an interval of values containing 
the "actual" value of the observable ( if we do not measure an observable of a quan- 
tum system exactly, then it does not really make sense to say that the observable 
has an actual precise value, unlike in classical mechanics where it is possible to think 
of an observable as having an exact value, even if we did not measure it exactly). 
For classical mechanics the most important observables (like energy, momentum and 
position) are not discrete but continuous, the major exception being the "number 
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of particles" which is important in the statistical mechanics of large systems, but 
usually not exactly determinable, simply because in this case there is typically a 
huge number of very small particles involved. For this reason we view the statistical 
nature of physics as fundamental, even for classical mechanics. Mathematically, the 
case where we do have complete information is simply a special case of statistical 
mechanics, and hence is covered by our work. We will therefore usually refer simply 
to "mechanics" (quantum or classical) , rather than "statistical mechanics" . When 
we do use the term "statistical mechanics" , it will be in the traditional sense, namely 
to refer to large systems where there are too many parts (usually small particles) 
for each to be measured individually (so we do not know the position, momentum 
and so on of each particle) . In this case only a small number of parameters referring 
to the system as a whole (or to pieces of the system much larger than its individual 
parts) can in practice be measured, for example the temperature, volume, mass and 
pressure of a gas confined to some container. 

Having set up a unified framework for quantum and classical mechanics, we pro- 
ceed to consider recurrence and ergodicity. These concepts originated respectively in 
Poincare's work on celestial mechanics and in Boltzmann's work on classical statis- 
tical mechanics, and now form part of what is known as ergodic theory. We want to 
study recurrence and ergodicity in our unified framework for mechanics to gain some 
insight into the properties of quantum mechanics as opposed to classical mechanics. 

The notion of Poincare recurrence in classical mechanics is quite well-known. 
Roughly it means that within experimental error a classical system confined to a 
finite volume in phase space will eventually return to its initial state. This happens 
because of Liouville's Theorem, which states that Lcbesgue measure is invariant un- 
der the Hamiltonian flow in the phase space R^". Ergodicity in classical mechanics 
refers to the situation where for every observable and (almost) every pure state of a 
system, the time mean of the observable (for that pure state) is equal to its average 
value on the constant energy surface containing the pure state, in which case the 
system is called ergodic. Again Liouville's Theorem is an implicit ingredient, since it 
induces a time-invariant measure on the constant energy surface (see Remark 3.2.8 
for a brief discussion). It should be noted that ergodicity is of some importance in 
physics, since it forms the starting point of many developments of statistical me- 
chanics (see for example [Rue, Section 1.1]). To study recurrence and ergodicity in 
quantum mechanics, we can expect from these remarks that we will need a quan- 
tum mechanical analogue of Liouville's Theorem. We propose such an analogue in 
Section 1.7 of Chapter 1, and in the process we are naturally led to consider finite 
von Neumann algebras. 

Recurrence docs in fact occur in quantum mechanics. One approach to recurrence 
in quantum mechanics has been through the theory of almost periodic functions (see 
for example [BL], [HH] and [Fere]). Another line of research, involving coherent 
states, along with possible applications of quantum recurrence, can be traced in 
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[SLB] and references therein. However, these methods differ considerably from the 
measure theoretic techniques employed to study recurrence in classical mechanics. 
In Section 3.1 of Chapter 3 we will see how recurrence (in a probabilistic sense) 
in quantum mechanics can be cast in a mathematical form that looks the same as 
the classical case, using our unified formulation of mechanics. More precisely, the 
quantum case is a noncommutative extension of the classical case. 

The mathematical aspects of recurrence and ergodicity is the subject of Chapter 
2, where one clearly sees that these concepts are not really measure theoretic in 
nature, as it might seem from numerous books (for example [Pet] and [Wa]), but 
rather ^-algebraic, with the basic tools being some Hilbert space techniques. The 
idea is to study recurrence and ergodicity in the most general mathematical setting 
possible. This then includes our unified framework for mechanics as a special case. 
In Chapter 3 we look at a few physical aspects of recurrence and ergodicity, including 
some speculation on the relevance of these ideas in quantum mechanics. 

The original inspiration for this thesis came from [NSZ], where recurrence is 
studied in a C*-algebraic framework from a purely mathematical point of view. The 
work presented here is for the most part based on [D2], [D3] and [DS]. 



Chapter 1 



A C*-algebraic formulation of 
mechanics 

In this chapter we formulate quantum mechanics and classical mechanics in the lan- 
guage of C*-algebras. The exposition is based on [D2] and [D3], but contains some 
additional material. As we shall see, the general structure of quantum mechan- 
ics and classical mechanics arc identical, except for commutativity, when both are 
viewed purely in C*-algebraic terms. We therefore obtain a unified framework for 
mechanics which will be seen to be very natural for studying some ergodic properties 
of quantum and classical mechanics in Chapter 3. 

Sections 1.1 to 1.5 present general aspects of mechanics in a statistical frame- 
work, and in Section 1.6 an interpretation of quantum mechanics inspired by the 
mathematical setting is discussed. The physical concepts are introduced gradually 
in the sense that certain ideas are initially only used intuitively, since their for- 
mal presentation can only be given once the framework has at least been partially 
erected. Sections 1.7 to 1.9 treat specialized topics to be used in Chapter 3; these 
topics do not apply to mechanical systems in general. 

1.1 Yes/no experiments 

We start with two simple definitions that apply to both quantum mechanics and 
classical mechanics: 

1.1.1 Definition. An observable of a physical system is any attribute of the 
system which results in a real number when measured, where this measurement must 
be verifiable, in other words, if the measurement is repeated immediately (so no 

disturbance or time- evolution of the system occurs between the measurements) then it 
results in the same real number. We call this real number the value of the observable 
during the measurement. 
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If a measurement is not verifiable in the sense of Definition 1.1.1, then there is no 
well-defined value of whatever it is that we measured, and hence we do not consider 
it to be a measurement of an observable. 

1.1.2 Remark. By a measurement (or observation) we mean that an observer re- 
ceives information regarding the physical system. The verifiability of a measurement 
essentially says that the information obtained in the measurement is correct, since it 
means that if we could repeat the measurement then we would with probability one 
get the same result. This is the type of measurement we will deal with in this thesis. 
We can therefore also refer to a measurement as a preparation. This is an idealiza- 
tion of reality (also see [Om, p. 82] on defining an ideal measurement in terms of 
verifiability). In the worst cases a system might in practice even be destroyed by a 
measurement (for example a particle absorbed by a detector), and then a repetition 
of the measurement would not be possible. Without idealization however, it would 
be impossible to do physics. After an ideal framework has been set up, non-ideal 
situations can be understood in terms of the ideahzation. See Sections 1.5 and 1.6 
for more on ideal measurements. It is very important to mention that we will view 
all (ideal) measurements as yes/no experiments, defined below, which means that 
an ideal measurement docs not necessarily supply complete information, but only 
correct information (also see Section 1.4). The idea of a single value in Definition 
1.1.1 should therefore be viewed only as preliminary, to help us to build up the 
statistical ideas used later on.B 

1.1.3 Definition. Consider any observable of a physical system, and any Borel set 
5 C M. We now perform an experiment on the system which results in a ''yes" if 
the value of the observable lies in S during the experiment, and a "no" otherwise; 
the experiment gives no further information. We call this a yes/no experiment. 

Definition 1.1.3 seems justified, since in practice there are always experimental 
errors during measurements, in other words we always get a range of values (namely 
S in Definition 1.1.3) rather than a single value. 

1.1.4 Remark. In quantum mechanics one should be careful in interpreting Def- 
inition 1.1.3. While in classical mechanics the mathematical framework allows us 
to assume (if wc want to) that there is some objective single value of an observable 
at the time of a measurement (even though we only get a set of values), this view 
can not be held in quantum mechanics. In quantum mechanics the different values 
in the set correspond to orthogonal state vectors (for simplicity we assume for the 
moment that the observable's spectrum is discrete), but the system need not be in 
any of these states, it can also be in a superposition of them, meaning that none of 
the values in the set is the "actual" objective value of the observable. 
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So, if in quantum mechanics a measurement returns a set of values, then we 
cannot view any one of these values as being the actual objective value of the 
observable. However, a series of measurements of the same observable (assuming 
there's no time-evolution, measurements of other observables, or outside influences 
on the system) should at least be consistent with each other, in the sense that the 
intersection of the sets obtained in the measurements should be non-empty ■ 

Typically a measurement gives an interval which contains the value of the observ- 
able being measured. For example, a measuring instrument with a "digital" read-out 
possessing only four digits might read 1.520, which means that the value lies in the 
interval [1.5195, 1.5205]. We now assume that this is then the only information we 
have concerning the value (for example, we do not have a non-constant probability 
distribution for where in the interval the value lies). The interval [1.5195,1.5205] 
here plays the role of S in Definition 1.1.3. 

To clarify the interpretation of Definition 1.1.3, we give another example. Let's 
say we measure the x-coordinate of a given particle in some physical system (quan- 
tum or classical) and we obtain the interval [a, b]. Then we view this as the yes/no 
experiment "Does the x-coordinate of the given particle lie in [a, 6]?" performed on 
the system, and that it resulted in a "yes". Similarly for any other observable of a 
system, and any Borel set S instead of [a, b]. Hence we can view a measurement of 
any observable as a yes/no experiment. 

Since Definition 1.1.3 is stated for arbitrary Borel sets S, rather than just the 
special case of intervals, it covers a much wider class of situations than the examples 
above. For example, instead of an interval [a.b], an experiment might give us some 
union of possibly unbounded intervals. We can mention that since we will use 
measure theory in any case (especially when dealing with classical mechanics), the 
introduction of Borel sets at this stage does not cause any extra effort later on. 

We now want to show how the yes/no experiments can themselves be viewed 
as observables. Consider any property that the system may or may not have that 
can be verified or negated by a verifiable measurement (in the sense of Definition 
1.1.1) which results in the value 1 if the system has this property, and the value 
otherwise. Then we can view this property as an observable which can have the 
value 1 or 0. Now consider the yes/no experiment resulting in a "yes" if the value of 
the observable is 1 (i.e. the value lies in some Borel set containing 1 but not 0, for 
example {1}), and a "no" otherwise. Then the observable and the yes/no experiment 
are really one and the same thing, with the yes/no experiment merely relabelling 
the values 1 and as "yes" and "no" respectively. An example of a property as 
discussed here is "The x-coordinate of the particle lies in [a;i,a;2], the |/-coordinate 
in [yi,y2] and the 2;-coordinate in [zi, Z2]" for a given particle in a physical system 
(where in this example we use Cartesian coordinates). 

It should therefore now be clear that the observable in Definitions 1.1.1 and 
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1.1.3 may be some property (as above) comprising a combination of other observ- 
ables together with sets in which their values might he, such as the three position 
coordinates (each an observable) and the three intervals in the last example. The 
point we are trying to make is that Definition 1.1.3 is very general, applying to any 
property as discussed above. It must be stressed though, that the verifiability of 
the measurement of such a property is essential here. Say for example we consider 
the property "The particle's position lies in [^1,52], and its momentum in [pi,P2]" 
of a system consisting of a single particle confined to a straight line. In the case 
of quantum mechanics this property is not an observable in the sense described 
above, since it turns out that if we measure the position and momentum, and then 
immediately measure them again, their values need not be the same as during the 
first measurement, that is to say the measurement of the property is not verifiable. 
This "odd" behaviour is the classic example of how quantum mechanics differs from 
classical mechanics, since in the latter this property is in fact an observable. It is 
usually expressed as saying that the position and the momentum can not be mea- 
sured simultaneously (or that the two observables are not compatible). In principle 
we can study this type of behaviour for a property constructed from an arbitrary 
set of observables of a physical system. In Sections 1.2 to 1.6 we will see that the 
only real difference between quantum and classical mechanics is that the latter is 
commutative (or abelian) while the former is not (the meaning of this will become 
clear in Sections 1.2 to 1.6). Therefore the noncommutativity of quantum mechanics 
must be responsible for its "odd" behaviour as compared to classical mechanics. 

1.1.5 Remark. The idea of yes/no experiments (and their projections; refer to 
Section 1.2) can be traced back to [vNl], where yes/no experiments are viewed as 
"propositions" stating various possible properties of the system, a property being 
verified if we obtain a "yes" in the corresponding yes/no experiment. In classical 
mechanics the first hint at yes/no experiments seems to be [vN2] where von Neu- 
mann asks the question "Does P belong to 6 or not?", P being the pure state of the 
system as a point in the phase space, and 9 a measurable set in the phase space. 
(We will return to this very question in Section 1.3, but in terms of Definition 1.1.3 
and its interpretation explained above.) The idea was further developed in [BvN] 
for both quantum and classical mechanics. ■ 

1.2 Quantum mechanics 

Let's look at the C*-algebraic formulation of quantum mechanics (also see [Ha]). 

Consider any quantum mechanical system. We represent the observables of the 
system by a unital C*-algebra 21, called the observable algebra of the system, and 
the state of the system by a state uj on 2t, that is to say a; is a normalized positive 
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linear functional on 21. (By normalized we mean that a;(l) = 1, and by positive that 
u;{A*A) for all A e 21.) At this stage we attach the intuitive meaning to the term "the 
state of the system"; wc will return to this in Section 1.4. 21 contains the spectral 
projections of the system's observables rather than the observables themselves. By 
this we mean the following: To any yes/no experiment that we can perform on the 
system, there corresponds a projection P in 21 such that a;(P) is the probability of 
getting a "yes" during the experiment for any state lo of the system. We will refer 
to P as the projection of the yes/no experiment. 

We will only consider yes/no experiments for which the experimental setup is 
such that at least in the case of a "yes" the system survives the experiment (for 
example, it is not absorbed by a detector), so further experiments can be performed 
on it. What does the system's state look hke after such an experiment? Consider 
for the moment the Hilbert space setting for quantum mechanics. Here the (pure) 
states of a system are represented by non-zero vectors, called state vectors, in a 
Hilbert space 9), called the state space of the system. Suppose the state is given 
by the unit vector x in S). After a yes/no experiment the state is given by the 
projection of x on some Hilbert subspace of Sj. Denoting the projection operator 
onto the subspace in case of a "yes" by Q, we see that the system's state after 
the experiment would then be given by the unit vector Qa;/||Qa;||, according to 
the projection postulate ("collapse of the wave function"). It is clear that Q is the 
projection of the experiment, since = {x,Qx) is exactly the probability of 

getting a "yes". (Here the state 9 on the C*-algebra of all bounded linear 

operators on Sj, given by d{A) = (x. Ax), is the C*-algebraic representation of the 
state X, in the sense of uj above, with £(^) serving as the observable algebra.) 

Returning to our system with observable algebra 21, we know by the GNS- 
construction (see Section 2.2, or for example [BR, Section 2.3.3]) that there exists 
a (cyclic) representation of (21, a;), namely a Hilbert space Sj, a *-homomorphism 
TT : 21 — > and a unit vector Q in 9), such that 

uj{A) = {Q,7r{A)Q) (2.1) 

for all A in 21. This looks like the usual expression for the expectation value of an 
observable (here represented by 7r(A)) for a system in the state Q in the Hilbert space 
setting (compare 6 above). On a heuristic level we therefore regard Sj as the state 
space of the system, and Q as its state. Say the result of the yes/no experiment 
with projection P is "yes". On the basis of the Hilbert space setting described 
above, it would now be natural to expect that after the experiment the state is 
represented by the unit vector Q' — 7r(P)Q,/ ||7r(P)fi||, since 7r(P) is the projection 
of the experiment in the Hilbert space setting in the same way as Q above (and 
hence vr(P) here plays the role of Q). Note that ||7r(P)r2||^ = uj{P) > since this is 
exactly the probability of getting the result "yes". We now replace fl in (2.1) by fl' 
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to get a new expectation functional uj' defined by 

u'{A) = {n',n{A)n') 

for all A in 21. Clearly uj'{A) = uj{PAP)/uj{P), so cj'(l) = 1, which imphes that iu' 
is a state on 21. Based on these arguments we give the following postulate: 



1.2.1 Postulate. Consider a quantum mechanical system in the state u on its 
observable algebra 21. Suppose we get a "yes" during a yes/no experiment performed 
on the system. After the experiment the state of the system is then given by the state 
uj' on 21 defined by 

uj\A)^uj{PAP)/uj{P) (2.2) 
for all A in 21, where P is the projection of the yes /no experiment. 



Suppose the state is expressed in terms of a density operator p on a Hilbert space 
^, namely uj{A) = Tr(pA) for any bounded linear operator A on the Hilbert space. 
(Here density operator refers to a positive operator p G £(-$3) with Tr(p) = 1.) From 
Postulate 1.2.1 it then follows that after the experiment the density operator is given 

by 

in the case of a "yes" . This is sometimes referred to as the Liiders rule (see [Hu, p. 
274] or [Lu]), and by the arguments above we see that this rule can be viewed as 
the projection postulate applied to a vector in a "bigger" Hilbert space, in which p 
is represented by this vector. The equivalence of (2.2) and (2.3), assuming we only 
consider states given by density operators, follows from the fact that if Tr(piA) = 
Tr(p2^) for all A G £(^3) for two density operators pi and p2 on ij, then setting 
A^ pi- P2 gives 

■■(Pi-P2)l, =Tr((pi-p2)') = 



where ||-||-^ denotes the trace-class norm; see [Mu, p. 63 and 65]. Hence {P1—P2Y = 
and therefore ||pi — P2||^ = ||(pi — P2)^|| = 0, where ||-|| denotes the usual operator 
norm. So pi = p2, proving the equivalence, namely that p' is the unique density 
operator insuring that oj'{A) — 'Tt{p'A) satisfies (2.2). 

Lastly we mention that the time-evolution of the system is described by a one- 
parameter ^-automorphism group r of 21, so if the projection of a yes/no experiment 
is P at time 0, then at time t the projection of the same yes/no experiment will be 
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1.3 Classical mechanics 

Now we turn to the C*-algebraic formulation of classical mechanics. We can repre- 
sent the pure state of a classical system by a point in its phase space M^", where n of 
the entries are the generalized position coordinates, and the other n their conjugate 
momenta. This point is called the phase point of the system. This is somewhat 
restrictive since such a point represents exact knowledge of the state of the sys- 
tem, which is impossible in practice. Therefore we rather represent the state of 
the system by a Borel measure n on M^" such that n{S) is the probability that the 
system's phase point is somewhere in the Borel set S C M.^"'. In particular we have 

We view each observable of the system as a Borel function / : M^" M. This 
simply means that if the system's phase point is x G K^", then the value of the 
observable is f{x). If we perform a yes/no experiment to determine if /'s value lies 
in the Borel set ^ C M, then the probability of getting "yes" is clearly 

{f~\V)) ^ J Xf-i(v)dl^ 

where x denotes characteristic functions (i.e. for any set A, the function xa assumes 
the value 1 on A, and zero everywhere else). We can view ^ spectral pro- 

jection of the observable /, and we will refer to it as the projection of the yes/no 
experiment, just as in the quantum mechanical case. Note that Xf-^{v) is a projec- 
tion in the C*-algebra 5oo(R^") of all bounded complex- valued Borel functions on 
M^"^ where the norm of Soo(]R^) is the sup-norm, its operations are defined point- 
wise, and its involution is given by complex conjugation (we will use the *-algebraic 
notation g* = g for the complex conjugate of a complex- valued function g). We can 
define a state tu on the C*-algebra Boo(U.^"') by 

^(^) = J gdfJ^ (3.1) 

for all g in ^^^(IR^"'). Then we see that the probabihty of getting a "yes" in the 
above mentioned yes/no experiment is u!{xf-i(v))- So we can view cu as representing 
the state of the system in exactly the same way as in quantum mechanics, where 
now Booi^'^"') is the unital C*-algebra representing the obscrvables of the system. 
For this reason we call B^{M?^^) the observable algebra of the system. 

Postulate 1.2.1 then holds for the classical case as well, as we now explain. Let 
S C M^" be a Borel set. The probability for the system's phase point to be in 
both S and f~^{V) is merely the probability for it to be in S* fl f~^{V), which is 
n{S r\ f~^{V)). A "yes" in the above mentioned yes/no experiment would mean 
that the system's phase point is in f~^{V), and the probability of this is n{f~^{V)). 
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Denote by the so-called conditional probability that the system's phase point 

is in S, given that the phase point is in f~^{V). Hence we should have 



It follows that if a "yes" is obtained in the experiment, then we can describe the 
system's state after the experiment by the measure /i' given by 



for all Borel sets S C M^". It is easily verified that /x' is indeed a Borel measure on 
M^". As for the case of fi and u; in (3.1), fi' corresponds to the state lo' on Soo(M^") 
given by 



(the second equality follows using standard measure theoretic arguments, i.e. first 
prove it for g a characteristic function and then use Lebesgue convergence; refer to 
[Rud]). This is exactly what Postulate 1.2.1 says if we replace the word "quantum" 
by "classical". 

For the time-evolution of a classical system we need the concept of a flow. Con- 
sider a measure space (X, where is a measure defined on a cr-algebra E 
of subsets of the set X. A flow on {X, E, ^) is a mapping i i— > on IR with the 
following properties: Tt is a function defined on X to itself, Tq is the identity on X 
(i.e. To(,t) = x), TgoTt = Tg+t, and Tt[S) e E and ii{Tf{S)) — IJ>{S) for all S in E. 
We denote this fiow simply by Tj. 

The time-evolution of our classical system is given by a fiow Tt on (M^",;B, A), 
where B is the cr-algebra of Borel sets of R^", and A is the Lebesgue measure on R^". 
Note that this statement contains Liouville's theorem, namely X{Tt(S)) = X{S) for 
all S in B. We call Tt the Hamiltonian flow. It simply means that if at time the 
system's phase point is x G M^", then at time t its phase point is Tt{x). 

As in the C*-algebraic approach to quantum mechanics, we want the time- 
evolution to act on the observable algebra rather than on the states. Suppose the 
system's phase point is x at time 0. Consider an observable given by the function / 
at time 0. Then the value of the observable at time is f{x), and hence at time t its 
value must be f{Tt{x)) = (/o Tt){x), where on the left hand side of the equation the 
time-evolution is applied to the phase point, and on the right hand side it is applied 
to the observable. So it is clear that an observable given by / at time 0, will be 
given by / o at time t if the time-evolution acts on the observables rather than on 
the states (this is the well-known Koopman construction, [Ko]). This is equivalent 
to the action of Tt on the spectral projections of /, since X{foTt)-^(V) = X/-i(y) ° Tt 
for all Borel sets C M. We explain the meaning of this in more detail: Suppose 



f^{f-'{v))f^'iS)=i,isnr\v)). 



(3.2) 



i,'{S) = i,{snf-\v))/i,{f-\v)) 
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the state of the system is described as in (3.1). At time t we perform the yes/no ex- 
periment "Does the value of the given observable he in V7" . Let's say that x e R^" 

is the phase point of the system at time 0. The value of the observable is in V at 
time t if and only if f{Tt{x)) G V, in other words if and only if a; G (/ o Tt)^^(y). 
The probability for this being the case (in other words, the probability of getting a 
"yes" in the experiment) is 

((/ o T)-\V)) = uj{x{foTt)-Hv)) = i^iXf-Hv) o Tt) 

as explained at the beginning of this section. This means that at time t the projection 
of the yes/no experiment is given by Xf-^{v) It is easily seen that if we define 
T by 

Tti9)=9oTt (3.3) 

for all g in i?oo(IR^"), then r is a one-parameter >K-automorphism group of the C*- 
algebra i?oo(R^")- So the time-evolution is described in exactly the same way as in 
quantum mechanics when viewed in C*- algebraic terms. 

We have now obtained a C*-algebraic formulation of classical mechanics. Note 
that i?oo(IR^"') is an abelian C*-algebra. Replacing Boo{^'^"') by an arbitrary abelian 
unital C*-algebra would give us an abstract C*-algebraic formulation of classical me- 
chanics. From our discussion above it is clear that if in the C*-algebraic formulation 
of quantum mechanics described in Section 1.2 we assume that 21 is abelian, then 
we get exactly this abstract C*-algebraic formulation of classical mechanics. Setting 
21 = Soo(M^"') would make it concrete. In this sense the C*-algebraic formulation of 
quantum mechanics actually contains classical mechanics as a special case. 

1.3.1 Remcirk. Here we used Soo(R^") as the classical observable algebra. Other 
choices are possible in certain approaches to statistical mechanics. For example 
some C*-algebra of continuous functions on the phase space (see for example [Rue, 
Section 7.1]), but in general this precludes projections and will therefore not do for 
our purposes. ■ 

1.4 The general structure of mechanics 

We now summarize our work thus far to gain some perspective. 

In a mathematical description of a physical system (quantum or classical), we 
need to describe four things: 

(a) The observables of the system (as defined in 1.1.1). 

(b) The state of the system, by which we mean the observer's information re- 
garding the system. (Wc assume that the observer knows what the system is, i.e. he 
knows what the observables are.) The case of maximal information is called a pure 
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state. We can say that by definition the state of the system is a mathematical object 
which for each possible outcome of each measurement that can be performed on the 
system, provides the observer with the probability for obtaining that outcome when 
performing that measurement. We can then also say that the observer's information 
about the system is by definition this state. (Note that the state of the system is 
not an objective property of the system, but depends on the observer.) The state of 
the system must be constructed from data gained during measurements previously 
performed on the system. Of course, we have to assume that the measurements are 
accurate (i.e. the data is correct, also see Remark 1.1.2), even though they may not 
be precise (i.e. the data is incomplete), for example when we measure the position 
of a classical particle we get a set of possible values rather than a single value, but 
the value of the position during the measurement is contained in this set. 

(c) The measuring process. This is clearly closely connected to (a) and (b), 
since the obscrvablcs are exactly that which is measured, while the result of a mea- 
surement gives the observer new information regarding the system, that is to say a 
measurement changes the state. We can view all measurements of the observables 
as yes/no experiments, as explained in Section 1.1. 

(d) The time-evolution of the system (dynamics). In other words, how the prob- 
abilities mentioned in (b) change as we move forward (or backward) in time. 

The results of Sections 1.2 and 1.3 (for a quantum or classical mechanical system) 
are: 

(i) We describe the observables by an observable algebra 21 which for each point 
in time contains a projection corresponding to each yes/no experiment that can be 
performed on the system (at that point in time). (These projections are referred to 
as spectral projections.) 21 is taken as a unital C*-algebra. 

(ii) The state of the system is described by a state on 21 (in the C*-algebraic 
sense defined in Section 1.2), such that for every yes/no experiment, a;(P) is the 
probability of getting "yes", where P is the projection of the yes/no experiment at 
the time at which it is performed. (Obviously this implies that the probability of 
getting "no" is 1 - cj(P) = cu(l - P).) 

(iii) Regarding the measurement process we just have to describe how the state 
is changed by a yes/no experiment. This is given by Postulate 1.2.1, which also 
holds for a classical mechanical system as explained in Section 1.3. That is to say, 
if a "yes" is obtained in the yes/no experiment, then after the experiment the state 
of the system is given by the state cu' on 21 defined by 

u'{A) - uj{PAP)/cu{P) 

for all A in 21, where P is the projection of the yes/no experiment. (We will have 
more to say about the measuring process in the next two sections.) 
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(iv) The time-evolution is given by a one-parameter *-automorphism group r of 
21, such that if at time the projection of a given yes/no experiment is P, then at 
time t the projection of the same yes/no experiment will be Tt{P). (The choice of 
when time is, is arbitrary, since r is a group.) 

This is the general structure of mechanics. As will be discussed in more detail in 
the next two sections, this general structure is nothing more than probability theory 
(actually, it is a noncommutative generalization of classical probability theory). It is 
a mathematical framework for dealing with information. When applied to a physical 
system, this information is the observer's information regarding the system, in other 
words, the system's state. 

1.4.1 Remark on hidden variables. We have now seen that quantum and 
classical mechanics have the same general structure, from a probabilistic point of 
view, with classical mechanics being the special case where the observable algebra is 
abelian. Suppose that there is some classical theory underlying quantum mechanics 
(a hidden variable theory) and that quantum behaviour is the result of our igno- 
rance of these "hidden variables" . A good guess would then be that this underlying 
theory has the general structure given above, the observable algebra being abelian, 
where we lack precise information about the physical system being studied (also 
see [Ma, pp. 180-184] and references therein). But this fails to explain the non- 
commutative behaviour of quantum mechanics in a simple way. It would therefore 
seem that a hidden variable theory would be a complicated way of "explaining" the 
fact that quantum mechanics is simply a noncommutative generalization of classical 
probability theory. Hidden variables are then excised by Occam's razor. ■ 

1.4.2 Remark on spectral projections. For a quantum mechanical observable 
represented by a (possibly unbounded) self-adjoint linear operator A in the state 
space 9), the projection of the yes/no experiment "Is the value of A in VT can 
be taken as the spectral projection Xy(A) in terms of the Borel functional calculus 
on self-adjoint operators; refer to [SZ, 9.9 to 9.13, and 9.32] for the construction 
and properties of this calculus. Loosely speaking, this projection represents the part 
of A whose spectrum is contained in the Borel subset V of M. It is interesting to 
note that this is very similar to the classical case in Section 1.3, where we used 
X^-i(y) — Xv ° f instead of xv{^)- We can write Xvif) '■= Xv ° f to complete the 
analogy, where more generally g{f) := gof defines a Borel functional calculus on the 
measurable functions / : F — M for Borel measurable : M — C Here the classical 
observable / is also self-adjoint, namely /*:=/ = / since it is real- valued. ■ 
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1.5 Measurements and conditional probabilities 

The quantum mechanical projection postulate for the state vectors, which we used 
in Section 1.2, often seems somewhat mysterious. However, within the general 
structure of mechanics it is quite natural, as we now explain. 

In Section 1.2 we extended this projection postulate to arbitrary states on an 
abstract observable algebra to obtain Postulate 1.2.1. This was done using a very 
natural heuristic argument based on the projection postulate for state vectors. In 
Section 1.3 we motivated Postulate 1.2.1 for a classical mechanical system by using 
the idea of a conditional probability. 

By a conditional probability we mean the probability for some event A to occur, 
given the information that some event B has occurred. Denote this probability by 
p{A\B). Denote by p{A) the probability for an event A to occur if no information 
regarding occurrences of other events arc available. Denote by AClB the event where 
the events A and B both occur. Then it is intuitively clear for any two events A 
and B that 

p{B)p{A\B)^p{AnB). (5.1) 

This is exactly what we used in equation (3.2). 

To understand the intuition behind this, consider for example the case of a finite 
number (of equally probable) sample points, say the six faces of a fair die. Let S be 
the set of sample points (we call it the sample space), then events are represented 
by subsets of S. (Hence the notation Ad B above; it is just the usual intersection 
of sets.) Suppose S contains n points, and let A and B be events containing a and 
b sample points respectively, while An B contains c sample points. Then 

p(A) = -, p(B) = - and p(AnB) = -. (5.2) 
n n n 

If we have the information that B occurred, then our sample space collapses to the 
set B. Event A now consists of its sample points in S, in other words it is given by 
Ar[ B. Hence the probability of A is now 

p{A\B) = (5.3) 

From (5.2) and (5.3) we obtain (5.1). 

The same argument can be applied to the case where the sample space 5* is a flat 
bounded surface with one of its point marked in some way, but we don't know which 
point. Then the probability for a subset A (Z S to contain the marked point is given 
by (area of A) /(area of S), and hence A and B should be Lebesgue measurable. 
So, the probability of an event is the "size" of the set representing the event. It is 
essentially this measure theoretic idea that is used in Section 1.3, where the phase 
point is the marked point. 
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Refer to [Fe] for more on probability, including sample spaces and conditional 
probabilities. 

In Section 1.3 we saw that in the case of classical mechanics. Postulate 1.2.1 is 
simply another way of expressing (5.1) in the measure theoretic setting for prob- 
ability theory. Hence, in quantum mechanics. Postulate 1.2.1 can be viewed as a 
"noncommutative conditional probability". (Also see [Bu].) So the "mysterious" 
projection postulate of quantum mechanics is mathematically merely a noncommu- 
tative extension of the conditional probability encountered in classical mechanics. 
(Also see [Petz] for a short survey of the closely related idea of noncommutative 
conditional expectations, or refer to [OP].) It should of course be kept in mind that 
the physical consequences of the quantum projection postulate differs surprisingly 
from that of classical mechanics, with the Uncertainty Principle as the archetypical 
example (it essentially states that the position and momentum of a particle in one 
dimension can not be measured simultaneously, as was also mentioned in Section 
1.1). 

We can now formalize the idea of an ideal measurement (see Remark 1.1.2): 

1.5.1 Ideal measurements. Postulate 1.2.1 can be viewed as the definition of an 

ideal measurement in quantum mechanics. Replacing the word "quantum" by "clas- 
sical". Postulate 1.2.1 defines an ideal measurement in classical mechanics. In short 
one can say that an ideal measurement in mechanics is defined by (iii) in Section 
1.4. So an ideal measurement is a change in the observer's information regarding 
the system, via a (possibly noncommutative) conditional probability. (Note that by 
"ideal" we do not mean "precise". In classical mechanics "ideal" means that the 
system is not disturbed by the measurement. The same interpretation can be used 
in quantum mechanics, as will be seen in Section 1.6.)B 

1.6 An interpretation of quantum mechanics 

There are several problems surrounding the interpretation of quantum mechanics, 
mainly involving the measuring process. What does the collapse of the wave func- 
tion mean? What causes it? And so on. In this section we argue that these prob- 
lems are essentially present in classical mechanics as well. In classical mechanics a 
measurement is nothing strange. It is merely an event where the observer obtains 
information about the system (we consider the case of an ideal measurement as in 
1.5.1). A measurement therefore changes the observer's information. One can then 
ask: What does the change in the observer's information mean? What causes it? 
And so on. These questions correspond to the questions above, but now they seem 
tautological rather than mysterious, since our intuitive idea of information tells us 
that the change in the observer's information simply means that he has received 
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new information, and the change is caused by the reception of the new information. 
We will see that the quantum case is no different, except that the nature of infor- 
mation in quantum mechanics differs from that in classical mechanics. We now first 
describe the basic idea, and afterwards we show how it is actually an outgrowth of 
the mathematical framework we've been developing. 

Let's say an observer has information regarding the phase point of a classical sys- 
tem, but not necessarily complete information (this is the typical discussed 
in Sections 1.1 and 1.3). This information was of course obtained by measurements 
the observer performed on the system (remember, by definition a measurement is 
the reception of information by the observer). Now the observer performs a mea- 
surement on the system to obtain new information (for example he might have infor- 
mation regarding a particle's position, now he measures the particle's momentum). 
The observer's information after this measurement then differs from his information 
before the measurement. In other words, a measurement "disturbs" the observer's 
information. 

In classical mechanics we know that an observer's information isn't merely dis- 
turbed, but is actually increased by a measurement (assuming the measurement 
provides new information). We will view this as an assumption regarding the na- 
ture of information which does not hold in quantum mechanics. On an operational 
level, this can be seen as the essential difference between quantum mechanics and 
classical mechanics: In both quantum and classical mechanics the observer's infor- 
mation is disturbed (changed) by a measurement if the measurement provides new 
information, but in classical mechanics the observer's information before the mea- 
surement is still valid after the measurement, while in quantum mechanics this is 
not necessarily the case. 

In Section 1.5 we saw that the projection postulate of quantum mechanics is 
essentially a noncommutative conditional probability which contains the classical 
conditional probability as a special case. In fact, the general structure of classi- 
cal mechanics described in Sections 1.3 and 1.4 is nothing more than probability 
theory (together with a time-evolution). One can shift the perspective somewhat 
by saying that this general structure is a probabilistic description of information. 
Since quantum mechanics has exactly the same the general structure, except that 
it is noncommutative, the mathematics seem to tell us that the general structure 
of quantum mechanics is a probabilistic description of noncommutative informa- 
tion. This noncommutative nature of information in quantum mechanics is what 
causes the essential difference between quantum mechanics and classical mechanics 
mentioned above. (Also see [D3], on which this section is based.) 

1.6.1 Information. We can view (i)-(iv) of Section 1.4 as the abstract axioms for 
a probabilistic description of information, where the information can be noncommu- 
tative. Axiom (iii) is then a (noncommutative) conditional probability describing 
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how information changes when new data (the result of a measurement in the case 
of physics) is received. Here we define information as being a state on an observ- 
able algebra (or as the probabilities given by the state), with the information called 
noncommutative if it changes via the noncommutative conditional probability. If we 
were to add the assumption that the observable algebra is commutative, then we 
get an abstract formulation of classical probability theory with the usual conditional 
probability. The algebras 5oo(K^") (or more generally B^(F) for a phase space F; 
see Remark 1.7.2) and 2{S^) are nothing more than convenient representations (of 
the commutative and noncommutative cases respectively), suitable for doing physics 
(in the way explained in Sections 1.2 and 1.3).B 

Interpreting quantum mechanics in this way implies that an (ideal) measurement 
disturbs the information regarding a system's state, rather than disturbing the sys- 
tem itself as is often argued (see for example [So, Section 1.6]). (In [I] a similar 
remark is made: "a measurement produces an uncontrollable disturbance in the 
potentiality for different results to be obtained in later measurements" (p. 165), but 
this remark becomes much clearer in the present setting in terms of information.) 
This then renders many problems surrounding measurements in quantum mechan- 
ics no more difficult than in classical mechanics. The answer to both question at 
the beginning of this section is simply that the observer received new information 
(i.e., the observer made a measurement), exactly as for the corresponding classical 
questions. (In particular this means that consciousness has no role to play in the 
measuring process. The observer could be a computer connected to a measuring 
instrument, or the measuring instrument itself, as long as it can receive information 
from the system.) We give a few more examples: 

1.6.2 The Heisenberg cut. This refers to an imaginary dividing hue between the 
observer and the system being observed (see for example [vNl] and [Ha]). It can 

seen as the place where information crosses from the system to the observer, but it 
leads to the question of where exactly it should be; where does the observer begin? 
In practice it's not really a problem: It doesn't matter where the cut is. It is merely 
a philosophical question which is already present in classical mechanics, since in 
the classical case information also passes from the system to the observer and one 
could again ask where the observer begins. The Heisenberg cut is therefore no more 
problematic in quantum mechanics than in classical mechanics. ■ 

1.6.3 When does the collapse of the wave function take place and how long does 
it take? (See for example [Su, p. 212].) This is essentially the Heisenberg cut 
with space replaced by time. One can pose the question as follows: When does an 
observer "absorb" the information received from a measurement (i.e., when does the 
measurement take place), and how long does it take? Again the quantum case is no 
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different from the classical case, and moreover, in practice it is no more of a problem 
than in the classical case.B 

1.6.4 Continuous observation (see [Su] and [Ho]). The ideal measurement dis- 
cussed in Remarks 1.1.2 and 1.5.1 refers to a single measurement made at some 
point in time. It can therefore not be applied directly to continuous observation, 
i.e. when the observer's information is continually changing. However, in classical 
mechanics this is not considered a conceptual problem, since one could in princi- 
ple describe such a situation as a continual change in the probability distribution 
(probability measure) describing the information, even though it might be a diffi- 
cult technical problem in practice. The same is true in quantum mechanics, with 
the probability distribution replaced by a state representing noncommutative in- 
formation. (In quantum mechanics however, the idea of continuous observation is 
probably an idealization, for example watching something without blinking your 
eyes is not a continuous measurement, since the photons registered by your retina 
are discrete.) 

The "paradox of the watched pot that never boils" (called Zeno's paradox by 
[MS]) is resolved by noting that if an observer continuously measures a certain ob- 
servable, then the system can still evolve in time to produce other values for the 
observable if the measurement is not precise (as is typically the case). Say the ob- 
server measures an observable A which has a discrete spectrum, and he can only 
determine its value up to some interval containing (at a point in time) a number of 
eigenvalues of the observable, say ai, a„. Then the state vector is projected onto 
the subspace spanned by the eigenstates (at that point in time) corresponding to 
ai, a„, in other words, onto the subspace which at that point in time corresponds 
to the interval (keep in mind that time-evolution acts on the observable algebra, and 
hence on the eigenstates of the observable). This happens according to postulate 
(iii); see for example [CDL, Section III.E.2.b]. To clarify our argument, we assume 
here that before the continuous measurement starts, the observer has maximal in- 
formation, i.e. his information is a state vector [the general case does not differ 
significantly, since it is still handled with the same projection postulate (iii)]. Note 
that the state is now still a state vector, and not a mixture of the eigenstates cor- 
responding to 0,1, .... a„. The interval which is measured (and hence the eigenvalues 
of A contained in it) can change in the course of time (for example it can drift up 
and down the real line), simply because of the lack of precision in the continuous 
measurement. Therefore the value of A can change within this drifting interval, in 
turn allowing the drifting interval's average location to change accordingly, which is 
what the observer sees. In the mathematics this looks as follows: The continuous 
measurement confines the state vector via the projection postulate to the "drifting" 
subspace corresponding to the drifting interval. The observable's eigenstates are 
evolving in time, but since this drifting subspace contains many eigenstates of the 
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observable at any point in time, the projection postulate does not cause the state 
vector to be "dragged along" by one of the time-evolving eigenstates. Also, since 
the interval is drifting, eigenstates arc moving in and out of the subspace. Therefore 
the state vector can be projected onto subspaces containing new eigenstates (cor- 
responding to new eigenvalues), with eigenstates brought closer to the state vector 
by time-evolution having higher probability. (This argument becomes somewhat 
clearer in the Schrodinger picture, where the eigenstates are fixed, but the subspace 
is still drifting.) 

If the continuous measurement is precise enough, then quantum mechanics in- 
deed predict that "a watched pot never boils" if the observable's eigenvalues are 
discrete (precise measurement of a continuous observable is impossible in practice). 
This happens because a quantum measurement can invalidate previous informa- 
tion (i.e. the state vector can change by projection) which then "cancels out" the 
changes due to time-evolution acting on the observable algebra (and thus on the 
observable's eigenvectors onto which projection of the state vector occurs). In effect 
the state vector is dragged along by the time-evolving eigenstate corresponding to 
the measured value. In classical mechanics on the other hand, previous information 
is not invalidated by measurement, hence the values of observables can change as 
time-evolution acts on the observable algebra while the pure state of the system 
stays put. Note that this is true even if the classical observable being observed is 
discrete (for example "number of particles in the left half of the container"). So no 
matter how closely we watch a classical pot, it can still boil.B 

1.6.5 The EPR "paradox." Einstein, Podolsky and Rosen [EPR] described a now 
famous experiment in which two particles are created together (or interact) and 
then move away from each other (which ends any interaction between them) before 
a measurement is performed on one of the particles. This measurement then gives 
corresponding information about the other particle as well. [This is the result of an 
entanglement of the two particles' states (for example due to a conservation law), 
which can occur since the state space is the tensor product of the two particles' 
state spaces.] EPR argued that this means that the second particle simultaneously 
has values for two noncommuting observables like position and momentum, since 
only the first particle is measured (either its position or its momentum is measured, 
but not both), and hence quantum mechanics must be incomplete, since it says that 
a particle docs not simultancoTisly have values for position and momentum. They 
based this on the idea that a measurement on the first particle does not disturb the 
second. However, we have viewed a measurement as the reception of information 
by the observer; it has nothing to do with the observer "directly" observing (and 
disturbing) the system. Measuring the first particle gives the observer information 
regarding the second particle as well (and hence is a measurement of the second 
particle), which is mathematically described by the second particle's state vector 
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(representing the observer's noncommutative information about this particle) now 
being in an eigenspace of the observable which was measured. This is no different 
from the analogous situation in classical mechanics where for example conservation 
of momentum can give the second particle's momentum when the first particle's 
momentum is measured, except that in this case information is commutative. 

We can even have two observers A and B measuring the same observable of the 
two particles respectively (as in [I] for example). A's measurement is then also 
a measurement of the value B will get (A receives information about what B's 
result will be) and so there's nothing strange in them getting correlated results (say 
opposite values for momentum; or opposite values for spin z, where the particles 
have spin half as in Bohm's version of the EPR experiment, [Bo]). No signal need 
travel faster than the speed of light to B's particle to "tell" it to have the opposite 
value to A's result, in the same way that no such signal is needed in the classical 
case. From A's point of view, B is part of the system along with the two particles, 
and so this experiment is really no different from the original one observer EPR 
experiment above. The particles along with B are in a superposition of states from 
A's point of view until A measures his particle, which reduces (by projection) the 
state vector of the combined system of particles and B, with B then in the eigenspace 
"B gets the opposite value" .■ 

1.6.6 System and observer as a combined system (see [I] for a clear exposition). 
Here the time-evolution of the combined system is supposed to account for the pro- 
jection postulate of quantum mechanics. This is not possible in a natural way, since 
time-evolution is the result of a one-parameter *-automorphism group. In classical 
mechanics the combined system evolves according to classical dynamics (the observer 
being thought of as a classical system in this case), and this then similarly would 
have to account for the change in the observer's information via a conditional prob- 
ability due to a measurement he performs on the system. Again this is not possible 
in a natural way, since here too we have the same projection postulate, namely the 
conditional probability (iii) in Section 1.4 acting on the state (of the system without 
observer), while the time-evolution acts as a one-parameter *-automorphism group 
on the observable algebra. The solution is that the state of the combined system 
has to contain from the start the fact that the observer will perform a measurement 
on the system at a given point in time and will subsequently experience a change 
of information (this change is a physical process in the observer, described by the 
combined system's time-evolution, for example some neural activity in a human ob- 
server's brain), otherwise such a measurement and the change of information would 
not take place. This is clear, since time-evolution docs not act on the state, but on 
the observable algebra, hence the state of the combined system is the state "for all 
time" and does not change when the observer performs a measurement. Exactly the 
same is true for quantum mechanics (where the observer is then also viewed as a 
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quantum system) . The (noncommutative) conditional probability, that is to say the 
projection postulate, is only relevant when the observer is not considered to be part 
of the system, in which case the conditional probability says what the change in the 
observer's information will be, it docs not describe the physical process taking place 
in the observer to accommodate (or store) the new information. ■ 

In connection with the two-slit experiment we mention the following: 

1.6.7 The two-slit experiment. Assume that the probability distribution for the 
position of detection of a particle on the screen in the two-slit experiment is given 
by an interference pattern when no measurement is performed at the two open slits 
(this is due to the wave nature of quantum particles, which is not accounted for 
by the abstract concept of noncommutative information (in 1.6.1) by itself, but 
rather follows from the specific form of dynamics of quantum mechanics). This 
distribution represents the observer's information about where on the screen the 
particle will be detected. In the light of our discussion thus far, it should then 
not be too surprising that this distribution (i.e. the observer's information) can be 
invalidated via the noncommutative conditional probability (iii) in Section 1.4, if 
the observer does measure through which slit the particle goes (i.e. if the observer 
receives new information), giving a completely different probability distribution at 
the screen. This is unlike the classical case where a measurement at the slits gives 
the observer more information, rather than invalidating previous information. (Also 
see [Bu].)B 

The point we attempt to make with examples 1.6.2 to 1.6.6 is that, even though 
there might be certain problems surrounding the measuring process, quantum me- 
chanics does not introduce any new conceptual problems not already present in 
classical mechanics when one considers a single observer performing measurements 
on a physical system, as long as we assume that information is noncommutative in 
quantum mechanics. 

We can also consider the case of more than one observer touched upon in 1.6.5: 

1.6.8 Thought experiment. Say three observers A, B and C are observing the 
same system, but B and C are not aware of each other or of A. B and C measure two 
noncommuting obscrvables P and Q respectively, in the order P, Q, P, and A in 
turn measures B and C's results in this order ( he "sees" each of their results at the 
time they obtain them). We ignore the time-evolution of the system. Say the results 
are pi, q, p2 (in this order), then clearly pi and p2 need not be the same since P and 
Q do not commute. So from B's point of view it seems that something disturbed 
the system between his two measurements of P. However, in our interpretation it is 
actually B's information that has been invalidated by A and C's measurement of Q. 
This is not too strange, since B and C are merely A's measuring instruments. One 
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could ask what would happen if A wasn't there. Would B then get pi ~ p2 with 
probability one? In the absence of A, does it even make sense to talk of the time order 
P, Q, P if B and C arc not aware of each other? In our interpretation time ordering 
should probably be viewed as in some way defined by information received by an 
observer, and in this case it seems possible that B would get pi = p2 with probability 
one in the absence of A and no other way to define the time ordering. (Note that 
in the two-slit experiment, for example, there is a time ordering in the sense that 
a measurement on a particle at the slits is performed before a measurement on the 
same particle at the screen, even if the measurements are performed by two different 
observers not aware of each other, so the interference pattern at the screen can still 
be destroyed in this setup.) The idea of defining time ordering in terms of a series 
of events (an event in our case being the reception of information by an observer) 
was introduced in [Filj.B 

We have now seen that the general structure of quantum mechanics as presented 
in Section 1.4 is essentially a mathematical framework for handling noncommutative 
information. Based on this, we make the following two remarks: 

1.6.9 The structure of spacetime. If we assume that information in our phys- 
ical world is described by quantum mechanics, then we are lead to conclude that 
information is actually a noncommutative phenomenon. Perhaps this means that 
since information "lives" in spacetime (and possibly in some way defines spacetime 
structure as was alhided to in 1.6.8), spacetime itself is noncommutative, as has 
been suggested in attempts to construct quantum spacetime and quantum gravity; 
see for example [DFR]. (This opens the possibility that spacetime is discrete like 
many other quantum phenomena; see for example [Sm] for a popular account.) On 
the other extreme, the term "noncommutative information" may be a "purely gram- 
matical trick" of the sort [Ma, p. 188] mused might "be the ultimate solution of the 
quantum measurement problem"; this possibility seems somewhat less interesting 
however. ■ 

1.6.10 The lineeir structure of quantum mechanics. The general structure of 
classical mechanics in Sections 1.3 and 1.4 is linear since it is nothing more than 
probability theory, even though it can be applied to physical systems where nonlinear 
aspects might be involved. It is the statistical point of view that makes everything 
linear (essentially this boils down to the use of averages, which are integrals and 
hence linear). The same goes for quantum mechanics. Its linear structure should 
not be viewed as an approximation to an underlying nonlinear world, but simply as 
a result of the fact that it is a mathematical framework for probability theory (i.e. 
statistics, averages), where the information involved happens to be noncommutative. 
The appearance of a Hilbert space as the state space is simply a mathematical way 
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of representing the algebraic structure in Section 1.4. So the hnearity of (and hence 
superpositions in) the state space is just a convenient way to express the fact that 
a measurement can invahdate the information the observer had before the measure- 
ment, or more precisely, to express noncommutative conditional probabilities. (Also 
see [Fi2, p. 175] and [Ha, p. 309] for similar remarks concerning the linearity of 
quantum mechanics. )■ 

Fuchs and others have also argued convincingly that information theoretic ideas 
are of great importance for the foundations of quantum mechanics, in particular 
that a quantum state represents an observer's information rather than having an 
objective reality (see [FuP], [Fu] and [CFS]). Refer to [St] for a review of quantum 
mechanics viewed as a generalization of classical probability theory. 

We cannot claim that this "noncommutative information interpretation" solves 
all of the conceptual problems of quantum mechanics, but for the case of a physical 
system being observed by an observer not considered to be part of the system, it 
does seem to clarify many issues without causing any new problems (except if you 
consider the idea of noncommutative information itself to be a problem). 

1.7 A quantum analogue of Liouville's Theorem 

In Section 1.2 to 1.4 we saw that in purely C*-algebraic terms, quantum mechanics 
and classical mechanics are identical, except of course for the fact that the classical 
observable algebra is abelian while this is not in general true for quantum mechanics. 
This suggests that it might be possible to find a quantum mechanical analogue of 
Liouville's Theorem, a search we pursue in this section for reasons explained in the 
Introduction, and simply because it is an interesting possibility in its own right 
(see Proposition 1.7.5 for the final result). Our first clue in this direction is the 
following simple proposition (where for a cr-algebra in a set X, we denote by Soo(S) 
the C*-algebra of all bounded complex- valued E-measurable functions on X, with 
the sup-norm, its operations defined pointwise, and its involution given by complex 
conjugation, as for the special case B^(M.'^"') in Section 1.3): 

1.7.1 Proposition. Let (X, S,/x) be a measure space with n{X) < oo, and let 
T : X ^ X be a mapping such that T~^{S) e S for all S & H. Define r and </? 

'^{9) = 9 ° T and ^p{g) = J gdfi for all g G 5oo(S). Then fi{T~^{S)) < fi{S) 
for all S E T, if and only if ip{T{g)*T{g))) < '^{g*g) for all g G -Boo(S). Also, 
IJ,{T~^{S)) = IJ>{S) for all S E 'E if and only if ip{T{g)) = ip{g) for all g e 3^(12). 

Proof. We use standard measure theoretic arguments (refer to [Rud]). 
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Suppose if {T{g)*T{g)) < (p{g*g) for all g e Boo{^), then it holds in particular 
for g — xs, where S & T:, and so 



Similarly for the case of equality. 

Conversely, suppose fi(T~^{S)) < n{S) for all S E T,. This is equivalent to 
having J Xs ° Tdn < J Xsd/J, for all 5" e E. By Lebesgue's Monotone Convergence 
Theorem this extends to all positive measurable functions, namely 



for positive / e B^{I]) by considering an increasing sequence (/„) of positive simple 
measurable functions converging pointwise to /, since then (/„ o T) is an increasing 
sequence of positive simple measurable functions converging pointwise to / o T. 
Setting / = g*g for any g G Soo(S), we obtain 



Similarly for the case of equality, and this then extends by linearity to (f {T{g)) — 



Consider a classical system whose phase point is confined to a Borel set F of 
finite volume in the phase space R^". That is to say A(F) < oo, where A is the 
Lebesgue measure on R^". 

1.7.2 Remcirk. If the phase point is confined to a set F C M^", then we can view 
F as the phase space of the system (whether F has finite volume or not), taking 
the cT-algebra S of measurable sets in F as the intersections of the Borel sets of 
M^" with F. (In Section 1.3 we simply used the Borel sets of M^" as the ci-algebra 
of measurable sets in the phase space.) We then replace the Lebesgue measure by 
its restriction to F (assuming F is Lebesgue measurable), and we use probability 
measures on F, instead of on M^". Also, the observables will be represented by 
E-measurable functions on F, and the observable algebra will be Boo{F) := Bao(T,). 
The whole of Section 1.3 can then be repeated with F in the place of M^".B 

We define a measure u on the Borel sets of R^" by 



= ^ {{Xs o T)* xs°T) = ip{T {xsT r (xs)) 

< viixsTxs) = ^{xs) 

= /i(5). 





(^(^)forall^eS^(E).B 



u{S) = x{snF). 
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Using Proposition 1.7.1 we see that Liouville's theorem for this system can then be 
expressed in C*-algebraic terms by stating that 

^inig)) = ^{g) (7.1) 

for all g in B^oi^'^"'), where r is given by equation (3.3), and ip{g) = J gdu (so 99 is a 
positive hnear functional on Soo(K2")). This is because v{T^t{S)) = A(r_t(5)nF) = 
\{T_t{S)nT_t{F)) = \{T_t{Sf\F)) = \{SnF) = u{S), since we have Tt{F) C F for 
all t G M (the phase point is confined to F) and so F C (T_t)~^(F) = Tt{F), which 
means that Ti[F) = F. Note that the condition ij,{X) < 00 in Proposition 1.7.1 
can be dropped if we only consider positive elements of i?oo(S). Hence (7.1) would 
express Liouville's Theorem for systems not necessarily bounded in phase space if 
we were to use A instead of u, and only consider positive elements g of B^(M.^'^). (In 
this case (f could assume infinite values, and it would not be a linear mapping on 
i?oo(^^^") any more.) We will only work with the bounded case though, since then 
the measure can be normalized to give a probability measure, which is what we will 
use when studying recurrence and ergodicity.. 

Since quantum mechanics has the same C*-algebraic structure as classical me- 
chanics, we now suspect that a quantum mechanical analogue of Liouville's Theorem 
should have the same form as (7.1). Let's look at this from a different angle. In the 
Hilbert space setting for quantum mechanics, the state space can be viewed as the 
analogue of the classical phase space R^". is a Hilbert space while we view R^" 
purely as a measurable space. Apart from dynamics, we saw in Sections 1.2 to 1.4 
that the central objects in both quantum and classical mechanics are the projections. 
A projection defined on is equivalent to a Hilbert subspace of (namely the range 
of the projection). A projection defined on R^"^ is a Borel measurable characteristic 
function, and is therefore equivalent to a Borel set in R^". Liouville's Theorem is 
based on the existence of a natural way of measuring the size of a Borel set in M^™, 
namely the Lebesgue measure A. We would therefore like to have a natural way of 
measuring the size of a Hilbert subspace of 9) in order to get a quantum analogue 
of Liouville's Theorem. An obvious candidate is the (Hilbert) dimension dim. For 
the Hamiltonian flow T^, Liouville's Theorem states that X{T_t{S)) — X{S) for every 
Borel set S. (We use T_i(^S) instead of Tj(jS'), since this corresponds to the action 
of Tt on the observable algebra rather than on the states, namely xgoTt = XT^tiS)-) 
In the state space time-evolution is given by a one-parameter unitary group Ut on 
Sj, and for any Hilbert subspace of we have dim{U^^) = dim{U_t^) = dim(.^). 
This is clearly similar to Liouville's theorem. For a finite dimensional state space we 
will in fact view this as a quantum analogue of Liouville's Theorem. (This remark 
is also made on p. 83-84 of [Ba].) However, since state spaces are usually infinite 
dimensional, we would like to work with something similar to dim which does not 
assume infinite values. 
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This leads us naturally to the C*-algebras known as finite von Neumann algebras 
(see for example [KR2]), since for such an algebra there is a dimension function, 
defined on the projections of the algebra, which does not assume infinite values. This 
function is in fact the restriction of a so-called trace defined on the whole algebra, 
so we might as well work with this trace. We now explain this in more detail. 

Let Wl denote a finite von Neumann algebra on a Hilbert space Sj, and let dJl' 
be its commutant. Then there is a unique positive linear mapping tr: SDt — > 9Jl fl 9Jl' 
such that tr{AB) = tr(5A) and tr(C) = C for all A, 5 G 971 and C G 971 H Tt' . We 
call tr the trace of 971. This trace is faithful, that is to say tr(A*A) > for A 7^ 0. 
(Conversely, if such a faithful trace exists on a von Neumann algebra 91, then 91 is 
finite [KR2, Section 8.1], and hence this could be taken as the definition of a finite 
von Neumann algebra.) We mention that in the special case where 9Jl = ^{Sj), with 
finite dimensional, tr is just the usual trace (sum of eigenvalues) normalized such 
that tr(l) = 1. 

For a projection P G 971 of onto the Hilbert subspace ^, we see that UlPUt 
is the projection of onto U^R, where Ut is a one-parameter unitary group on Sj. 
So in the framework of finite von Neumann algebras we would like to replace the 
equation dim(U^K) = dim(.^) mentioned above by tic{U^PUt) — tr(P) as a quantum 
analogue of Liouville's Theorem. 

If a sclf-adjoint (possibly unbounded) operator A in is an observable and 971 
an observable algebra of a physical system, then we want the spectral projections 
Xy(A) of A to be contained in 971, where V is any Borel set in R, since these 
projections are the projections of the yes/no experiments that can be performed on 
the system. But then f{A) G 97t for any bounded complex-valued Borel function 
/ on M. (Our argument here is roughly that there is a bounded sequence of bounded 
simple functions Sn converging pointwise to /, which implies that Sn{A)x — > f{A)x 
for all X e Sj, i.e. Sn{A) converges strongly to f{A). Since a von Neumann algebra 
is strongly closed, it follows that f{A) e 97t. See [SZ, 9.10, 9.11 and 9.32].) In 
particular e~*^* G 971 for all real t. 

For these reasons we will consider physical systems of the following nature: 

1.7.3 Definition. A bounded quantum system is a quantum mechanical system 
for which we can take the observable algebra as a finite von Neumann algebra 971 on 

a Hilbert space such that the Hamiltonian H of the system can he represented as 
a self-adjoint (possibly unbounded) linear operator in with e~^^^ G 971 for real t. 
We denote this system by (971, i^, if). 

The reason for the term "bounded" will become clear in Section 1.9. 

1.7.4 Remark. If for a bounded quantum system (DJl,S),H) the unit vectors of 
X E Sj are pure states of the system, that is to say {x, -x) is a pure state on 97t for 
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such X, then can be viewed as the state space of the system (this happens for 
example when VJt — £{S^) with Sj finite dimensional). However, the unit elements of 
Sj need not be pure states of the system, as we will now show, in which case Sj is 
not the state space of the system, but merely acts as a "carrier" for the observable 
algebra 971. 

Let be a finite dimensional Hilbert space, and consider a mixed (i.e. not pure) 
faithful normal state u; on £(6), for example a Gibbs state u;{A) — Tr (pA) where 
p = e-f^'^ /TT{e-'^^) with G e £(0) the Hamiltonian of some system with state 
space 0, and /? the inverse temperature of the system (sec [Dl, Proposition 2.3.9] 
for example). Here normal refers to the form Tr(p-) of the state, where p is a density 
operator, while faithful means that (jj{A*A) > if A 7^ 0. 

Let (53, TT, Q) be a cychc representation of (£(0),a;) as in Section 1.2. Let 9Jt := 
7r(£(0)) and H :— 7r(G), then we prove that (3Jt, i3,if) is a bounded quantum 
system. 

First, 9Jt is a von Neumann algebra, since £(0) is a von Neumann algebra and 
uj is normal [BR, Theorem 2.4.24]. Furthermore, tt is a *-isomorphism since u; is 
faithful [BR, Proposition 2.5.6]. (Also see [Dl, Proposition 4.4.9], for the same 
results.) It is known that £(0)' = C (see [Dl, Proposition 1.4.7]), and since 971 is 
♦-isomorphic to £(0), this means that the elements of 971 which commute with 971 
are also just the multiples of unity, that is to say 971 Pi 971' = C. Since tt is injective 
and 7r(l) = 1, we can therefore define a trace 971 — > 971 fl 971' (in the sense described 
above) by tic(7r{A)) :— tr(A), where tr on the right is the (normahzed) trace of £(0). 
This trace is faithful on 97t since the trace on £(0) is faithful. Hence 971 is finite (sec 
above). Since vr is a *-homomorphism from a Banach *-algebra to a C*-algebra, it 
is continuous [Mu, Theorem 2.1.7]. Hence 

^-iHt ^ ^-MG)t ^ 7r(e-^«*) e 971. 

This proves that (971, i3,i/) is a bounded quantum system. (As an example of the 
situation in Proposition 1.7.5 below, note that e~'^* gives the time-evolution of the 
system in terms of 971 rather than £(0), namely 

7r{e'^'Ae-"^') = e'^*7r(^)e-'^* 

for A e £(0).) 

However, the state (O, - Q) = uo o tt~^ is not pure on 9Jt, since uo is not pure (see 
[BR, Definition 2.3.14] for the formal mathematical definition of a pure state on a 
C*-algebra). In other words Q is not a pure state of the system, and therefore is 
not the state space of the system. ■ 

We now propose a quantum analogue of Liouville's Theorem based on the in- 
tuitive arguments in terms of dimension given above. We give it in the form of a 
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proposition (its proof is easy; the work went into finding a sensible candidate for 
such an analogue): 

1.7.5 Proposition. Consider a bounded quantum system {9)1, Sj, H) . By Stone's 
Theorem Ut — e~*^* is a one-parameter unitary group on Sj. Let r be the time- 
evolution of the system, i.e. Tt{A) = U^AUt for all A e Wl. Then 

tT{Tt{A)) = tr(A) (7.2) 

for all A in where tr is the trace of 9Jl. (This last statement is our quantum 
analogue of Liouville 's theorem.) 

Proof Since Ut G M, we have tr{Tt(A)) = tr(U;AUt) = tr(UtU*A) = tr(A). ■ 

As we suspected, our quantum analogue of Liouville's theorem, expressed by 
(7.2), is of the same form as the C*-algebraic formulation of the classical Liouville 
Theorem as given by (7.1), with (p replaced by tr. Remember that (p and tr are both 
positive linear mappings on the respective observable algebras. 

A somewhat different approach to a quantum analogue of Liouville's Theorem is 
described in [AM]. 



1.7.6 Remcirk. The classical Liouville Theorem can also be expressed in terms of 
the Liouville equation 

where p : M^" x R R is the density function, H the classical Hamiltonian, and 
{•, •} the Poisson bracket. This equation can be seen as describing the flow of a fluid 
in phase space such that at any point moving along with the fluid, the density of 
the fluid remains constant. So besides giving the time-evolution, this equation also 
states a property of the time-evolution, namely that it conserves volume in phase 
space. In quantum mechanics we have the analogous von Neumann equation 

where p : R -^£,{S)) is the density operator as a function of time (note that here 
the derivative with respect to time is total instead of partial) . This equation merely 
gives the time-evolution p(t) = r_t(p(0)) of the density operator, where r is the time- 
evolution on the observable algebra here viewed as acting on the state instead of the 
obscrvablcs. Von Neumann's equation by itself should therefore not be regarded as 
a quantum mechanical analogue of Liouville's Theorem.B 
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1.8 The state of no information 

In (b) of Section 1.4, we said that the state of a system is constructed from in- 
formation gained during measurements previously performed on the system. If the 
observer hasn't performed any measurements on the system, then he has no in- 
formation regarding the system (however, the observable algebra is assumed to be 
known, i.e. the observer knows what the system is). Can we describe this situation 
by a state on the observable algebra of the system? It turns out that we can in the 
framework of Section 1.7 (namely for bounded quantum systems and for classical 
systems with phase space F C M^" of finite volume). Such a state on the observable 
algebra can then be called a state of no information. 

1.8.1 Classical mechanics. Let's first consider a classical system. Assume that 
its phase point is confined to a (Borel) set F of finite volume in the phase space M^", 
i.e. A(F) < oo. (So we can view F as the system's phase space; see Remark 1.7.2.) 
We now argue that practical matters force us to assume X{F) > 0: In practice 
it is impossible to measure any of the position or momentum coordinates of the 
system precisely, so it is safe to assume that each of these coordinates can at best be 
determined only up to some interval of positive length, and hence F must contain 
the product of these intervals, which implies A(F) > 0. If F did not contain this 
product, it would not make sense for us to use F as the phase space of the system, 
since we would not even know if the system's phase point is contained in F. 

We can therefore normalize A on F by defining a probability measure A' on the 
Borel sets of M^" by 

X'{S)^X{SnF)/X{F). 

If we now view A' as describing a state of the system (as explained in Section 1.3), 
then it essentially says that every part of F is equally likely to contain the phase 
point of the system. Mathematically this boils down to the fact that the Lebesgue 
measure A is translation invariant, which means that it is the same everywhere, so 
A' can be viewed as a uniform probability distribution. In other words, when the 
observer knows nothing about where the phase point of the system is (aside from 
the fact that it is in F), then we can describe the observer's information by A', or 
in C*-algebraic terms by the state (p on Soo(M^"') defined by 

^{g) = J gdX'. 

Since Lebesgue measure is the unique (up to some normalization factor) translation 
invariant Borel measure on M^" assuming finite values on compact sets (which are 
bounded and therefore should have finite volumes), we can view (/? as the state of 
no information. (Refer to [Rud] for an exposition of the properties of the Lebesgue 
measure.) 
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For this state of no information to make sense, it has to be compatible with 
the time-evolution of the system in the following sense: If the observer has no 
information regarding the system at time 0, and he performs no measurements on 
the system up to some later time t, then at time t he still has no information 
regarding the system. This means that if we apply the time-evolution r of the 
system to the state (p instead of to the observable algebra, to obtain the state (fOTt 
at time t, then this state still has to represent the state of no information. That is 
to say, we must have o = ^p. But this is exactly what LioTivillc's Theorem states 
(see equation (7.1)). So wc sec that Liouville's Theorem is intimately related to the 
idea of information, in the sense that it ensures that the state of no information is 
compatible with the system's time-evolution. We can say that Liouville's Theorem 
makes the state of no information dynamically sensible. We can also view this as 
a special case of a group invariance defining a probability distribution, in this case 
invariance under time-evolution defining the state of no information (see [J] for more 
on this idea). 

1.8.2 Quantum mechanics. Now we turn to a bounded quantum system as 
defined in 1.7.3, namely {Tl,S),H) where we assume that 9Jt is a factor (that is to 
say 071 n 9Jt' = CI), which means that we can take tr to be complex-valued. (In 
general we will refer to a finite von Neumann algebra which is a factor, as a finite 
factor.) The reason for assuming OJl to be a factor is that tr is then a state on 3Jt, 
since we know that tr is positive and normalized. This means that tr can in principle 
represent a physical state as described in Section 1.2. 

In Section 1.7 we saw that tr can be viewed as a quantum analogue of integration 
over a bounded set in phase space with respect to Lebesgue measure A, in other 
words, as a quantum analogue of ip in 1.8.1. The basic intuition here is that our 
quantum analogue of Liouville's Theorem is expressed in terms of tr in precisely the 
same form as that in which Liouville's Theorem is expressed in terms of p. namely 
tT{rt{A)) = tr(A) as compared to p{Tt{g)) = p>{g). By this analogy between tr and p 
we would expect tr to be the state of the bounded quantum system when the observer 
knows nothing about the system, in other words that tr is a state of no information. 
This is indeed true in the special case where S) is finite dimensional and SD? = 'C(i5), 
since for any rank one projection Q in 9Jt we then have tr((5) = 1/ dim[Sj) which 
tells us that if the state is tr, then all eigenvalues are equally probable when an 
observable is measured (assuming the observable has no degenerate eigenvalues). 

As mentioned in Section 1.7, tr is the unique state on OJT such that ti {AB) — 
tr{BA) for all A,Be QJl, but this is in fact equivalent to the condition that 
tr{U*PU) = tr(P) for ah unitary [/ G 971 and aU projections P eM (see [KR2, 
Proposition 8.1.1 and its proof]). We can view unitary operators as rotations in 
the state space of the quantum system, so tr{AB) = ti^BA) tells us that rotations 
of the state space preserve the "size" of Hilbert subspaces (which correspond to 
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projections), where "size" here refers to the dimension function on the projections 
of OJl, mentioned in Section 1.7. This is the quantum mechanical equivalent of the 
classical situation where translations preserve Lebesgue measure, since as described 
in Section 1.7, the dimension of Hilbert subspaces of the state space should corre- 
spond to Lebesgue measure as a measure of the size of Borel sets (which correspond 
to projections in the classical case). In the same way as in the classical case in 
1.8.1, we can therefore view tr as the state of no information of a bounded quantum 
system. 

As explained in 1.8.1, Liouville's Theorem is central in the concept of a state of no 
information, since it makes such a state dynamically sensible. The same argument 
applies to our quantum analogue of Liouville's Theorem (Proposition 1.7.5) to see 
that it ensures that the state of no information tr is compatible with the system's 
time evolution, namely trort = tr. 

Furthermore, since tr is ultraweakly continuous, it is a normal state and hence it 
is given by a density operator (see [KR2, Theorem 8.2.8, Proposition 7.4.5, Theorem 
7.1.12] and [BR, Theorem 2.4.21]), as one might expect for a physically meaningful 
state (keep in mind, however, that this density operator is defined on Sj, which is not 
necessarily the state space of the system; see Remark 1.7.4). We therefore suggest 
the following hypothesis: 

1.8.3 Postulate. Consider a bounded quantum system {^)Jl,S),H), where ^M is a 
factor. If the observer has no information regarding the system, then the state of 
the system is given by the trace tr of 3Jt. 

1.9 Bounded quantum systems 

In this section we discuss the possible physical significance of bounded quantum 
systems, using the analogy with classical systems built up in Sections 1.7 and 1.8. 
What we want to know is which physical systems can be mathematically described 
as bounded quantum systems with the observable algebras being factors, since this 
is the type of system considered in Postulate 1.8.3. 

In Sections 1.7 and 1.8 we considered the case of a classical system whose phase 
point is confined to a set F of finite volume, which meant that we could view F as the 
phase space of the system. A special case of this is where the phase space is bounded 
(i.e. contained in some ball in M^"). Bounded sets are indeed less general than sets of 
finite volume, as witnessed for the set F — {(x, y) G : < y < e~^, < x < oo} 
which is an unbounded closed (and hence Borel) set which has a part of positive 
measure lying outside any ball in (we might call this set Lebesgue unbounded, 
since the part that goes to infinity does not have zero Lebesgue measure), but 
even so F has a finite Lebesgue measure of 1. (We will not pursue the question of 
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whether a Lebesgue unbounded phase space of finite volume actually occurs in any 
physical system, since our arguments here will be heuristic and based on the idea of 
boundedness.) 

From a physical standpoint the phase space is bounded if the system itself is 
confined to a finite volume in space, and it is isolated from outside infiuences (which 
could increase its energy content), to prevent any of its momentum components to 
go to infinity. To see that this is the case, use Cartesian coordinates. Here we 
assume that each potential of the form — 1/r or the like has some "cut-off" at small 
values of r, since for example particles are of finite size and collide when they get too 
close. The point of this is that there is not an infinite amount of potential energy 
available in the system (potentials do not go to —oo). Consider as illustration a 
potential with the general shape given by —1/r + 0.0015/r^, where the 0.0015/r^ 
term causes the cut-off, that is to say for "large" r the potential looks hke —1/r, 
but as r > decreases, the potential deviates from —1/r, reaches a minimum, and 
then goes to +oo. 

Based on the analogy between bounded quantum systems and classical systems 
with bounded phase space presented in Sections 1.7 and 1.8, we might now guess 
that quantum systems bounded in space and isolated from outside influences can 
be described as bounded quantum systems in the sense of Deflnition 1.7.3 with 9Jl 
a factor. 

Of course, the analogy actually extends to the more general case of classical sys- 
tems with phase space of finite volume, but since we have no hard evidence apart 
from this analogy, it is probably best not to push it to its limits. (We will find 
some additional indirect evidence supporting our guess when we discuss recurrence 
for quantum systems in Section 3.1.) Also, it is not exactly clear how the idea of 
a finite volume of phase space should be translated to quantum mechanics; possi- 
bly one could approach this problem by considering a quantum system which is a 
quantization of a classical system whose phase space has finite volume, however, the 
argument by analogy that this system too is a bounded quantum system, is becom- 
ing more and more tenuous. This seems to be related to the nuclearity requirement 
in quantum field theory (see [Ha]), where a finite volume in classical phase space is 
intuitively thought of as corresponding to a finite dimensional subspace of quantum 
state space. Since a quantum system whose state space S) is finite dimensional is 
clearly a bounded quantum system (the observable algebra 2{S)) is a finite factor in 
this case) , our guess certainly does not seem too far-fetched from this point of view. 

We state our guesswork as a conjecture: 

1.9.1 Conjecture. A quantum mechanical system bounded in space, and isolated 
from outside influences, can be mathematically described as a bounded quantum sys- 
tem in the sense of Defi,nition 1.7.3, with the observable algebra 9Jl a factor. 
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1.9.2 Remcirk. A bounded quantum system {n,S^,H) as defined in 1.7.3, with 
9Jl a factor, deviates from the usual "type I" quantum mechanics (see [Ha, Section 
VII.2]), in that the former does not necessarily have "finest" yes/no experiments. 
This refers to the fact that the range of the dimension function (on the projections 
of can be the whole interval [0, 1] which has no minimum non-zero value, in 
which case Wl is called a type IIi factor. It should be noted though, that a bounded 
quantum system always has pure states (states of maximal information), since any 
non-zero C*-algebra (and in particular a finite factor) has pure states (see [Mu, 
Theorem 5.1.11]), as is physically required, since nonmaximal information is a result 
of the observer's lack of precision rather than a property of the system. Loosely this 
means that although an observer can always do a finer measurement than the ones 
he already did, such a measurement will not necessarily improve his information, it 
might simply give new information invalidating his old information (noncommuting 
observables), but giving a "smaller" subspace in the state space, not contained in 
the subspace corresponding to his old information, since dim(P2) ^ dim(Pi) does 
not imply P2 < Pi in the C*-algebraic partial order. (Keep in mind that S) is not 
necessarily the state space, it just acts as a "carrier" for 3Jl; see Remark 1.7.4.) 

In type I quantum mechanics the observable algebra is simply taken as the type 
I factor Sl{Sj) where a separable Hilbert space 9j is the state space of the system. 
The dimension function on the projections of £{Sj) is simply the dimension of the 
range of a projection, and hence it has the minimum non-zero value 1; see [Co, p. 
455] for example. The projections with dimension one represent the finest yes/no 
experiments that can be performed on the system. ■ 

1.9.3 Example. A one-dimensional quantum harmonic oscillator has a discrete 
unbounded energy spectrum consisting of equally spaced values 

En = (2n + l)Eo 

for n = 0, 1, 2, ... where E'o > is the lowest energy value (see [CDL, Section V.B] or 
[Kre, Example 11.3-1]). In the state of no information each of these energy values 
should be equally likely, but that would mean that all of them have probability zero, 
which doesn't make physical sense, since if the oscillator's energy is measured, some 
value must be obtained, and so this value does not have zero probability. Therefore 
the state of no information does not exist as a state on the observable algebra in 
this case, which means that the oscillator is not a bounded quantum system. This 
makes sense, since the energy eigenstate in L^(M) corresponding to En is a "Gaussian 
tapered" Hermite polynomial of the form 

(where x is the position, and 7 a constant deriving from the physical properties of 
the oscillator, namely mass and frequency), which has a steadily increasing non- 
negligible spatial extension as n increases, corresponding to the classical situation 
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where the amphtude in space increases as the energy increases ([CDL, Section 
V.C.2] or [Kre, Example 11.3-1]). So if all the energy values are allowed, then the 
system is not bounded in space. 

An approximate description of a quantum harmonic oscillator bounded in space 
as a bounded quantum system, could be to take the state space S) as the finite dimen- 
sional subspace of L^(M) spanned by energy eigenstates corresponding to Eq, E^ 
for some N, and then using the finite factor £(^) as the observable algebra. How- 
ever, a careful analysis from the ground up would be necessary to see if an iso- 
lated quantum harmonic oscillator bounded in space is indeed a bounded quantum 
system. ■ 



Chapter 2 



Recurrence and ergodicity in 
*-algebras 

In this chapter (based on [DS]), results concerning recurrence and ergodicity are 
proved in an abstract Hilbert space setting based on the proof of Khintchine's re- 
currence theorem for sets, and on the Hilbert space characterization of ergodicity. 
These results arc carried over to a noncommutative *-algcbraic setting using the 
GNS-construction. This generalizes the corresponding measure theoretic results, in 
particular a variation of Khintchine's Theorem for ergodic systems, where the image 
of one set overlaps with another set, instead of with itself. 

2.1 Introduction 

The inspiration for this chapter is the following theorem of Khintchine dating from 
1934 (see [Pete] for a proof): 

2.1.1 Khintchine's Theorem. Let (X, E,//) be a probability space (that is to say, 
11 is a measure on a a-algebra E of subsets of a set X , with n{X) = 1), and consider 
a mapping T : X ^ X such that T^^{S) e E and ii{T~^{S)) < /i(5') for all 5* e E. 
Then for any A e E and s > 0, the set 

E^{keN:fx{Ar) T-\A)) > fx{Af - e} 

is relatively dense in N = {1, 2, 3, ...}. 

We will call (X, E,//,T), as given above, a measure theoretic dynamical system. 
Recall that the relatively denseness of in N means that there exists an n G N 
such that E fl {j,j + 1, j + n — 1} is non-empty for every j G N. Khintchine's 
Theorem is an example of a recurrence result. It tells us that for every k & E, the 
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set A contains a set A fl T~'^{A) of measure larger than — e which is mapped 

back into Ahy T''. 

A question that arises from Khintchine's Theorem is whether, given A,B & H 
and £ > 0, the set 



is relatively dense in N. This is clearly not true in general, for example if T is the 
identity and A, B and e are chosen such that fj.{A)f^{B) > e while Ar\ B \s empty, 
then F is empty. T has to "mix" the measure space sufficiently for F to be non- 
empty. In [Wa] it is shown for the case where ijl{T~^{S)) — ijl{S) for all S' G S, that 
if for every pair A, G S of positive measure there exists some A; G N such that 
(a n T~^(i?)) > 0, then the dynamical system is ergodic. Ergodicity therefore 
seems like the natural concept to use when considering the question posed above. 
This is indeed what we will do. 

The notion of ergodicity originally developed as a way to characterize systems 
in classical statistical mechanics for which the time mean and the phase space mean 
of any observable arc equal. For our purposes it will be most convenient to define 
ergodicity of a measure theoretic dynamical system (X, S,/i,,T) as follows (refer to 
[Pete], for example): (X, E,//,!") is called ergodic if the fixed points of the linear 
Hilbert space operator U : L^{n) L'^ifJ') '■ f ^ f ° T form a one-dimensional 
subspace of L^ifJ^)- Keep in mind that L'^^jJi) consists of equivalence classes of func- 
tions, with two functions equivalent if they arc equal almost everywhere, but it is 
easy to see that U is well-defined on L'^{n), that is to say, if / and g are measurable 
functions equal almost everywhere, then foT and goT are equal almost everywhere. 
Also, for / G L'^i/ji) we have 



and so /oT G LF'{^)- Furthermore this inequality says that ||?7|| < 1. Here fioT^^ 
is the measure on S defined by (/x o T^^) {S) := {T^^{S)) < n{S). 

As we shall see, the ideas we have discussed so far are not really measure theoretic 
in nature. This is in large part due to the fact that the proof of Khintchine's Theorem 
is essentially a Hilbert space proof using the Mean Ergodic Theorem. This proof can 
for the most part be written purely in Hilbert space terms, hence giving an abstract 
Hilbert space result. Along with the Hilbert space characterization of ergodicity 
given above, this means that a fair amount of ergodic theory can be done purely in 
an abstract Hilbert space setting. This is the approach taken in Section 2.4, using 
the Mean Ergodic Theorem as the basic tool. 

Having built up some ergodic theory in abstract Hilbert spaces, nothing is to 
stop us from applying the results to mathematical structures other than measure 
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theoretic dynamical systems. The mathematical structure we will consider is much 
more general than measure theoretic dynamical systems and can easily be motivated 
as follows: From a measure theoretic dynamical system (X, E, yU, T) we obtain the 
unital *-algebra _Boo(S) of all bounded complex-valued measurable functions defined 
on X, and two linear mappings 

if : B^{J:) ^ C : / ^ J fdfi 

and 

t:S^(E)^Soo(E) :/^/or (1.1) 

with the following properties: (p{l) — 1, <p{f*f) > 0, t(1) = 1 and (p{r{f)*r{f)) < 
ip{f*f) for all / G i?oo(S) by Proposition 1.7.1, where f* = f defines the invohition 
on i?oo(S), making it a *-algebra. We can view this abstractly by replacing i?oo(S) 
with any unital *-aIgebra and considering linear mappings (/? and r on it with the 
properties mentioned above. (A unital *-algebra 21 is an algebra with an involution, 
and a unit element denoted by 1, that is to say lA — A — Al for all A e 21. We 
will only work with the case of complex scalars.) The most obvious generalization 
this brings is that the unital *-algebra need not be commutative, for example the 
bounded linear operators on a Hilbert space. Also note that r in (1.1) is a *- 
homomorphism of i?oo(S), but we will not need this property of r in the abstract 
*-algebraic setting. We describe the *-algebraic setting in more detail in Section 2.3, 
and in Section 2.5 the Hilbert space results arc applied to this setting using the GNS- 
construction (treated in Section 2.2). In Section 2.6 we obtain the measure theoretic 
results as a special case, and also briefly discuss another special case, namely von 
Neumann algebras. 

In Section 2.7 an alternative approach to recurrence is described where (p is not 
required to be linear (which precludes the use of the GNS-construction), and can 
even assume values in a unital C*-algebra. Section 2.7 is independent from the rest 
of the work in this chapter. 

2.1.2 Remark. In Chapter 1 the observable algebra of a physical system was 
assumed to be a unital C*-algebra, rather than merely a unital *-algebra. This 
assumption is not restrictive, since the representations £(55) and i?oo(S), and also 
any von Neumann algebra, are indeed C*-algcbras. In the general structure of 
mechanics given by (i)-(iv) of Section 1.4 (in other words the abstract probabilistic 
description of noncommutative information; see 1.6.1) we can take the observable 
algebra 21 as merely a unital *-algebra without losing any of the ideas involved. But 
for more specific topics we need more structure, for example in the quantum analogue 
of Liouville's Theorem described in Section 1.7, where a finite von Neumann algebra 
is used as the observable algebra. Also, in the GNS-construction, used in Section 
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1.2, a C*-algebra delivers more than a mere *-algebra (see Remark 2.2.3). However, 
in this chapter we will use as few assumptions as possible to build the theory, and 
in Sections 2.2 to 2.5 we only need unital *-algebras.B 

2.2 Cyclic representations 

By a state on a unital *-algebra 21 we mean a linear functional (y? on 21 which is 
positive (i.e. (p{A*A) > for all A e 21) with ip{l) = 1. Let L{V) denote the 
algebra of all hnear operators V ^ V on the vector space V. 

2.2.1 Definition. Let ip he a state on a unital *-algehra 21. A cyclic rep- 
resentation of (21,9?) is a triple (0,7r, ^2), where (25 is an inner product space, 
TT : 21 ^ L(6) is linear with 7r(l) = 1, t:{AB) = t:{A)t:{B), Vt e <&, 7r(2t)Q = <&, 
and {'K{A)n,T:{B)n) = ^p{A*B), for all A, S e 21. 

A cyclic representation as in Definition 2.2.1 exists by the GNS-construction 
(given below), but we will not actually need the property t:{AB) — t:{A)t:{B) in 
this chapter. The term "cyclic" refers to the fact that 7r(2l)Jl = 0. Note that 

t:2t^0:A^7r(A)n (2.1) 

is a linear surjection such that t(l) = Q. Also, ||r2||^ — <^(1*1) = 1- We define a 
seminorm ||-||^ on 21 by 

for aU A e 21. 

2.2.2 The GNS-construction. Let (p : ^ C be a positive linear functional on 
a *-algebra 21. 

(i) Then there exists a inner product space (3, a linear surjection t : 21 — >• 0, 
and a linear mapping tt : 21 ^ such that 

{L{A),t{B))=p{A*B) 

n(A)L(B) = l(AB) 

and 

7r{AB) = 7r{A)7r{B) 

for all A,B e^. 

(ii) Now assume that 21 is unital, and set fl — From (i) it then follows 
that 

T^{A)n = l{A) 
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7r(l) = l 

and 

{TT{A)n,n{B)Q) = cp{A*B) = {n,TT{A*B)Q) 
for all A,BEf2l. In particular 

(p{A) = {n,n{A)n) 

for all Ae^. 

Proof. We have to construct 0, i and tt. This construction is called the Gelfand- 
Naimark-Segal (GNS) construction. 

(i) Consider the vector subspace 3 ^ {A e : = 0} of 21. Note that J is 

indeed a vector space, since ior A, B e 3 we have 

\\A + B\\l = \\A\\l + v{A*B) + ^{B*A) + 
<MA*B)\ + \^iB*A)\ 
<\\A\\^\\B\\^ + \\B\\JA\\^ 
= 

by the Cauchy-Schwarz inequality ([BR, Lemma 2.3.10]). Then (3 :— is also a 
vector space, on which we can define an inner product by 

{t{A),L{B)) := ^{A*B) 

where i : 21 — > is defined by 

i{A) —A + 3 

for all ^4 e 21. Note that t is a surjection by definition, and that it is linear. We 
show that this inner product is well-defined: 

Say t(C) = l{A) and l{D) = l{B), and set I :^ C - A and J :^ D - B. Then 

ip{C*D) = cpiA*B) + ip{A*J) + cfiFB) + ifiPJ) 

but |(/9(A*J)| < ll^ll^ ll^ll;^ = by the Cauchy-Schwarz inequality, since J E 3. 
Similarly (p{I*B) = (p{I*J) = 0, hence ip{C*D) = ip{A*B), proving that the inner 
product is well-defined. 

That (•, •) is indeed an inner product on follows from the definitions given, 
and the fact that 

WjABTi = = ^iB*A) = {l{B), ,{A)) 
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(see [BR, Lemma 2.3.10]). 
Define tt : 21 ^ L(6) by 

7r{A)i{B) = i{AB). 

7r{A) is a well-defined clement of ^^(^5), since i is a linear surjection, and if i{C) — 
i{B), then I := C — B & 3, and therefore by the Cauchy-Schwarz inequality 

\\AI\\l^\<p{{A*Airi)\<\\A*AI\\JIl^O 

which means that AI G 3, i.e. J is a left ideal of 2l,and this in turn implies that 
i{AC) = i{AB) + l{AI) = i{AB) + 3 ^ t^{AB), since 3 is the zero element of 6. 
Since l is linear, so is tt. Also note that for any A,B,C e 21, 

7r{AB)i{C) = i{ABC) = 7r{A)i{BC) = 7r{A)7r{B)i{C) 

so 7r{AB) = 7r{A)7r{B). 

(ii) By (i) we have 7i{A)n = 7r(A)i(l) = l{A1) = l{A) and 7r(l)i(A) = l{1A) = 
l{A) for all A. Since c is surjective, it follows that 7r(2l)r2 = t(2l) = and 7r(l) = 1. 
Furthermore, 

{7r{A)Q,7r{B)Q) = {i{A),i{B)) = ip{A*B) 
= <^{1*{A*B)) 
= {t{l),t{A*B)) 
= {n,7r{A*B)n) . 

In particular, setting ^4 = 1, we have (p{B) — {Q,7r{B)Q)M 

2.2.3 Remark. If 21 in 2.2.2 is a C*-algebra, then we can replace L{(5) by 
and using this boundedness, each 7r{A) can be uniquely extended to an element of 
ii(io), where is the completion of <3. This is what was used in Section 1.2. See 
[BR, Section 2.3.3] for details.B 

2.3 ^-dynamical systems and ergodicity 

Motivated by our remarks in Section 2.1, we give the following definition: 

2.3.1 Definition. Let ip he a state on a unital *-algebra 21. Consider any linear 
function r : 21 ^ 21 such that 

r(l) = 1 

and 

99(r(A)V(yl))<^(AM) 
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for all A e 21. Then we call (21, (p, r) a *-dynamical system. 

Note that for r as in Definition 2.3.1 and i given by equation (2.1), 

C/o : ^ : i{A) ^ i(r(A)) (3.1) 

is a well-defined linear operator with \\Uo\\ < 1, since ||i(T(>l))||^ = (p{r{A)*r{A)) < 
^{A*A) = UA)f. 

We now want to define the concept of ergodicity for a *-dynamical system. 

2.3.2 Definition. A *-dynamical system (21, </9,r) is called ergodic if it has the 

following property: For any sequence (A„) in 21 such that ||r(A„) — — and 

such that for any e > there exists an N eN for which \\A^ — An\\^ < e if m > N 
and n > N, it follows that \\An — a\\^ for some a e C. 

In Section 2.5 we will give a simple example of an ergodic *-dynamical system 
whose *-algebra is noncommutative. Recall that for any vectors x and y in a. Hilbert 
space $), we denote by x<^y the bounded linear operator S) defined by {x^y)z — 
X {y, z). The motivation for Definition 2.3.2 is the following proposition: 

2.3.3 Proposition. Consider a ^-dynamical system (21, (/?, r) and let Uq be given by 
(3.1) in terms of any cyclic representation of (21, ip) . Let U : ^ S) be the bounded 
linear extension of Uq to the completion Sj of <3, and let P be the projection of 
S) onto the subspace of fixed points of U. Then (21, </?, r) is ergodic if and only 
if P — fl ^ Q, or equivalently, if and only if the fixed points of U form a one- 
dimensional subspace of S^. 

Proof. Since = — we know that Q Q is the projection of onto 

the one- dimensional subspace CQ. Also note that UQ. — Q, since Q. — t(l), hence 
Cfi C P^. 

Suppose (21, v?, r) is ergodic and let a; be a fixed point of U . Consider any 
sequence {xn) in (S such that Xn — ^ a;, say Xn = i{An). Then ||r(A„) — An\\^ — 
\\Uxn — Xn\\ — > 0, since U is continuous, while for any £ > there exists some N 
for which \\Am — A^W^ = \\xm — Xn\\ < e ii m > N and n > N. Since (21, (/),r) is 
ergodic, it follows that — L.{a)\\ = — a\\^ for some a G C, but then 
X = i(a) = aQ. Therefore PSj = CQ which means that P = Q ^ Q. 

Conversely, suppose P = Q ^ Q and consider any sequence (An) in 21 such that 
||r(A„) — A„||^ — > and such that for any £ > there exists some N for which 
11^^ — < e ii m > N and n > N. Then Xn = /(A„) is a Cauchy sequence 

and hence convergent in i^, since — = ||A„,, — Say Xn — > a;, then 

Uxn — > Ux since U is continuous. Since ||f/a;„ — x„|| = ||r(A„) — — > 0, it 
follows that Uxn X, hence Ux = x. This means that x e PSj which implies 
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that X — afl for some a E C Therefore — a\\^ = — ail\\ — > 0, and so we 
conclude that (21, </?, r) is ergodic.B 

Proposition 2.3.3 tells us that Definition 2.3.2 includes the measure theoretic 
definition as a special case. This can be seen as follows: From a measure theo- 
retic dynamical system {X, E, /x, T) we obtain the *-dynamical system (i?oo(S), ip, r), 
where (p{f) = J fdfi and r(/) = / o T for all / e B^{T,). A cyclic representation 



of (i?oo(S),v^,r) is (6,7r,n) with & = {[g] : g E B^E)}, 7r{f)[g] = [fg] for all 



f, gE i?oo(2), and Q = [1], where [g] denotes the equivalence class of all measurable 
complex-valued functions on the measure space that are almost everywhere equal to 

g. Note that l defined by equation (2.1), now becomes t(/) = [/]. The completion 
of (25 is L'^ifji) by the following : 

2.3.4 Proposition. Let fi be a measure on a a-algebra E of subsets of a set X. 
Then := {[g] : g G i?oo(S)} is dense in L'^{n). 

Proof. For any E- measurable g : X ^ £. with (? > 0, we know that a sequence 
of simple E- measurable functions s„ exist such that < Si < S2 < ■■■ < g and 
Sn{x) — >• g{x) for all a; e X (see [Rud, Theorem 1.17]). So \sn{x) — g{x)\^ — >• 
for all X E X, while of course s„ E B^{Ti), and so [s„] E <25, for all n. Clearly 
^ Ifl'l^) so if we assume that [g\ E -^^(/i), then \g^ E L^i/J'), and we 
conclude by Lebesgue's Dominated Convergence Theorem [Rud, 1.34] that 



which means that [g] is contained in the closure of (5 in i>^(/x). For an arbi- 
trary [g] E L^ilJ'), we have the standard representation g = u'^ — u~ + iv^ — iv~ 
where u'^ , u~ , v'^ , v~ > are E-measurable ([Rud, 1.9(b) and 1.14(b)]). Note that 
[li"*"], [xi~], [i)"*"], [i)"] E L'^ifJ^), for example — u'^ < u'^ + u~ — \u\ < \g\ where 
u = — u~ . Since [«"*"], [u^], [v~^], [v~] are then contained in (25's closure, so is 
[g] = [u+]-[u-]+i[v+]-i[v-]M 

The operator U in Proposition 2.3.3 is now given by U[f] — [f oT] or, dropping 
the [•] notation as is standard for L^-spaces, 



for all / E L'^ilJ'), where / and f o T now denote equivalence classes of functions. 
Proposition 2.3.3 tells us that (i?oo(S), (/?, r) is ergodic if and only if the fixed points of 
U form a one dimensional subspace of L'^ifJ^), in other words if and only if [X, E, n, T) 
is ergodic, as was mentioned in Section 2.1. 




Uf = foT 
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Finally we remark that we use Definition 2.3.2 as the definition of ergodicity, 
since it is formulated purely in terms of the objects 21, ip and r appearing in the 
*-dynamical system (21, (/9,r), unlike Proposition 2.3.3 which involves a cyclic rep- 
resentation of these objects. However, as a characterization of ergodicity. Propo- 
sition 2.3.3 is generally easier to use. Of course, one might wonder if Definition 
2.3.2 could not be simplified by using a single element rather than a sequence. 
With U as in Proposition 2.3.3, and x — l{A) for some ^4 e 21, we have Ux — x 
if and only if ||C/a; — a;|| = 0, which is equivalent to ||t(74) — = 0. For er- 
godicity we need this to imply that x = aQ for some a G C, which is equiv- 
alent to — = ||a; — afl\\ = 0. However, we cannot define ergodicity as 
"||r(A) — = implies that ||A — q;||^ = for some a e C", since Proposi- 
tion 2.3.3 would no longer hold: There would be examples of ergodic *-dynamical 
systems for which the fixed points of U do not form a one-dimensional subspace of S). 
(In Appendix A.l we give such an example.) Our theory would then fall apart, since 
much of our later work is based on the fact that for ergodic systems the fixed point 
space of U is one-dimensional. For example, the characterization of ergodicity in 
terms of the equality of means of the sort mentioned in Section 2.1 (but extended to 
*-dynamical systems), implies this one- dimensionality. Also, this one-dimensionality 
is used in our proof of the variation of Khintchine's Theorem mentioned in Section 
2.1. (See Sections 2.4 and 2.5 for details.) The use of a sequence rather than a single 
element is therefore necessary in Definition 2.3.2. 

2.4 Some ergodic theory in Hilbert spaces 

Our main tool in this section is the following: 

2.4.1 The Mean Ergodic Theorem. Consider a linear operator U : Sj ^ S) with 
\\U\\ < 1 on a Hilbert space S) . Let P be the projection of onto the subspace of 
fixed points of U. For any x & S) we then have 



as n ^ oo. 

Refer to [Pete] for a proof. We now state and prove a generahzed Hilbert space 
version of Khintchine's Theorem: 

2.4.2 Theorem. Let S^, U and P be as in the Mean Ergodic Theorem above. Con- 
sider any x,y & S) and £ > 0. Then the set 




n— 1 



X 



Px 



E={keN:\{x,U%)\>\{x,Py)\-s} 
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is relatively dense in N. 



Proof. The proof is essentially the same as that of Khintchine's Theorem. By the 
Mean Ergodic Theorem there exists an n G N such that 



71— 1 

-Y^U'y-Py 



k=0 



< 



+ 1 



Since UPy = Py and ||[/|| < 1, it follows for any j eN that 

j+n—l n— 1 



- V W'y-Py < -Y^U^y-Py 



and therefore 



i+n-1 



fc=0 



< 



Ixll + 1 



- V U^y-Py) < 11x11 - V U^y-Py 

k=j I k=j 



j+n-1 



< e. 



Hence 



{x,Py)\-e< 



X 



n ^-^ n ^-^ 

k=j k=j 

and so | (x, U^y) \> \{x, Py) | — £ for some k e {j, j + 1, j + n — 1}, in other words 
E is relatively dense in N.B 

Khintchine's Theorem corresponds to the case where y = x (see Theorem 2.5.1). 
The following two propositions are the Hilbert space building blocks for two char- 
acterizations of ergodicity to be considered in the next section. 

2.4.3 Proposition. Let S^, U and P be as in the Mean Ergodic Theorem above. 
Consider an ^ & S) and let % be any total set in Sj. Then the following hold: 
(i) // P = Q ® Q, then 



n— 1 



k=0 







(4.1) 



as n ^ oo, for every y E S^. 

(ii) // (4-1) holds for every y G 1, then P = 

Proof. By the Mean Ergodic Theorem we know that 

71-1 



^Yu'y-Py 



(4.2) 



fe=0 
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for every |/ e as n — > oo, but for P = Q (8) Q we have Py — fl {Q, y) and this 
proves (i). 

To prove (ii), consider any y E %. From (4.1) and (4.2) it then follows that 
Py = Q {'^,y) = (f2 (g) Q)y. Since by definition the linear span of T is dense in S^, 
and since P and fl^fl are bounded (and hence continuous) linear operators on S), 
we conclude that P = Q (g) Q.B 

2.4.4 Proposition. Let Sj, U and P he as in the Mean Ergodic Theorem above. 
Consider an fl E and let & and X be total sets in S^. Then the following hold: 

(i) If P^n® n, then 

^ n—l 

-J2{^^U'y)^{x,n){n,y) (4.3) 

k=0 

as n ^ oo, for all x,y E S^. 

(ii) // (4.3) holds for all x e & and y e%, then P = Q (g) Q. 

Proof. Statement (i) follows immediately from Proposition 2.4.3(i) by simply taking 
the inner product of x with the expression inside the norm in (4.1). 

To prove (ii) , consider any x E & and y E%. Prom the Mean Ergodic Theorem 
it follows that 

^ n— 1 

-Y^{x,U^y)^{x,Py) 

k=0 

as n ^ 00. Combining this with (4.3) we see that {x, Py) = {x, fl) {fl, y) — 
{x, {fl ® fl)y) . Since the linear span of © is dense in Sj, this implies that Py — 
[fl (g) fl)y. Hence P = fl ®fl as in the proof of Proposition 2.4.3(ii).B 

The reason for using total sets will become clear in Sections 2.5 and 2.6. 

2.5 Ergodic results for ^-dynamical systems 

In this section we carry the results of Section 2.4 over to >K-dynamical systems using 
cyclic representations. Firstly we give a *-dynamical generalization of Khintchine's 
Theorem which follows from Theorem 2.4.2: 

2.5.1 Theorem. Let (21, (^,t) be a ^-dynamical system, and consider any A E% 
and £ > 0. Then the set 

E^{kEn: \ip{A*T^{A)) \ > \<p{A)f-e} 

is relatively dense in N. 
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Proof. Let U and P be defined as in Proposition 2.3.3 in terms of any cyclic repre- 
sentation of (21, (p). Set X — i{A). Prom equation (3.1) it is clear that Q — is a 

fixed point off/, so {n,x) = {Pn,x) = {n,Px). It follows that \(p{A)\ = \(pil*A)\ = 
\{^,x)\ < \\Px\\ = \\Px\\. We also have (p{A*t''{A)) = (^x,U^x). Hence by 
Theorem 2.4.2, with y = x, the set E is relatively dense in N.B 

A C*-algebraic version of Theorem 2.5.1 was previously obtained in [NSZ]. Next 
we use Theorem 2.4.2 to prove a variant of Theorem 2.5.1: 

2.5.2 Theorem. Let (21, </?, r) be an ergodic *-dynamical system, and consider any 
A,B and £> 0. Then the set 

E^{keN:\(p {At''{B)) I > \(p{A)ip{B)\ - e} 

is relatively dense in N. 

Proof. Let U and P be defined as in Proposition 2.3.3 in terms of any cyclic rep- 
resentation of (21, v?). Set X = l{A*) and y = i{B). By Proposition 2.3.3 we have 
Px = aQ and Py = (3Q where a = {Q, x) = {x, Q) = Lp{A**l) = ^p{A) and similarly 
(3 = ip{B). Therefore \{x,Py)\ = \{Px,Py)\ = \al3\ = \ip{A)ip{B)\. Purther- 
more, (p{At'^{B)) — (^x, U^y). Hence E is relatively dense in N by Theorem 2.4.2.B 

We are now going to prove two characterizations of ergodicity using Propositions 

2.4.3 and 2.4.4 respectively. But first we need to consider a notion of totality of a set 
in a unital *-algebra. (Remember that an abstract unital *-algebra has no norm.) 

2.5.3 Definition. Let ip be a state on a unital *-algebra 21. A subset % of ^ is 
called (p-dense in 21 if it is dense in the seminormed space (21, ||-||;^). A subset % 
of 21 is called (p-total in 21 if the linear span of T is (p-dense in 21. 

Trivially, a unital *-algebra is (p-total in itself for any state (p. 

2.5.4 Lemma. Let (p be a state on a unital *-algebra 21, and consider any subset 
T of 21. Let L be given by (2.1) in terms of any cyclic representation of (21, (/j), and 
let be the completion of (S. Then % is p-total in 21 if and only if is total in 

Proof. Suppose T is (/9-total in 21, that is to say the linear span 03 of T is (/9-dense 
in 21. Then t(?B) is dense in C5 = t(2l), since for any A G 21 there exists a sequence 
{An) in *B such that ||i(A„) - i{A)\\ = \\An - A\\^ 0. But by definition (3 is dense 
in S^, hence t(*B) is dense in i^. Since l is linear, this means that t(T) is total in f). 

Conversely, suppose is total in S^, then is dense in Sj. It follows that 
53 is p-dense in 21, since for any A G 21 there exists a sequence (An) in ?B such that 
11^^ - = \\i{An) - i{A)\\ 0. In other words, T is 9?-total in 21.B 
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2.5.5 Proposition. Let (21, ip, r) be a *-dynamical system, and consider any (p-total 
set T in 21. Then the following hold: 
(i) If (21, (f, t) is ergodic, then 



k=0 



(5.1) 



as n — > OO; for every A e 21. 

(ii) If (5.1) holds for every A&%, then (21, </?, r) is ergodic. 

Proof. Let U and P be defined as in Proposition 2.3.3 in terms of any cyclic rep- 
resentation of (21,^9). Suppose (21, r) is ergodic. For any A e 21 we then have 



n— 1 



A;=0 



W^\A)-^{A) = ^^Uh{A)-c{^{A)) 

lb lb 



n— 1 



fe=0 







(5.2) 



as n — > oo, by Proposition 2.4.3(i) and Proposition 2.3.3, since i {'^{A)) = L{l)(p{A) — 
rtip{l*A) = rt (Q, i{A)). This proves (i). 

Now suppose (5.1), and therefore (5.2), hold for every A G T. Since is total 
in according to Lemma 2.5.4, it follows from Proposition 2.4.3(ii) and the identity 
i {(p{A)) = n {n, i{A)), that P = n0n. So (21, (p, t) is ergodic by Proposition 2.3.3, 
confirming (ii).B 

In the spirit of the original motivation behind the concept of ergodicity, this 
proposition characterizes ergodic *-dynamical systems as those for which the time 
mean of each element A of the *-algebra converges in the seminorm ||-||^ to the 
"phase space" mean ip{A). A better name for the latter would be the system mean in 
this case, since there is no phase space involved. For a measure theoretic dynamical 
system (X, S, r, yu), the state ip is given by (/?(/) = J /rf/i which is indeed the phase 
space mean of / e -Boo(S)) where X is the phase space. We will come back to this 
in Section 2.6. 

For any subset 6 of a *-algebra, we write &* — {A* : ^4 e 6}. 

2.5.6 Proposition. Let (21, ip, r) be a ^-dynamical system, and consider any ip-total 
sets © and T in 21. Then the following hold: 
(i) // (21, ip, t) is ergodic, then 



-J2^{AT'{B))^ip{A)ip{B) 



(5.3) 



k=0 



as n ^ oo, for all A,B E ^. 
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(ii) // (5.3) holds for all A e &* and Bel, then (21, (^,r) is ergodic. 

Proof. Let U and P be defined as in Proposition 2.3.3 in terms of any cyclic repre- 
sentation of (21, Suppose (21, r) is ergodic. Then P = fl ^ fl by Proposition 
2.3.3, and so by Proposition 2.4.4(i) it follows that 

^ n— 1 ^ n—1 

{Ar^iB)) = -J2 {'-{An, Uh{B)) ^{A)^{B) (5.4) 

it lb 

fc=0 fc=0 

as n — oo, since {l{A*).,VL) = (f{A) and {Q,l{B)) = ip{B), as in the proof of 
Theorem 2.5.2. This proves (i). (Alternatively, (i) can be derived from Proposition 
2.5.5(i) using the Cauchy-Schwarz inequality |93(AC)| < ||^*||^||C||^ with C = 
- Y12=o — (p{B). This is essentially how Proposition 2.4.4(i) was derived from 

Proposition 2.4.3(i).) 

Now suppose (5.3), and therefore (5.4), hold for all A & &* and B E %. Since 
and are total in Sj according to Lemma 2.5.4, it follows from Proposition 
2.4.4(ii) and the identities {i{A*), n) = ip{A) and {n, l{B)) = ip{B), that P = Qi^n. 
So (21, 9?, r) is ergodic by Proposition 2.3.3, confirming (ii).B 

This characterizes ergodicity in terms of mixing. We now give a simple example 
of an ergodic ^-dynamical system whose *-algebra is noncommutative: 

2.5.7 Example. Let 21 be the unital *-algebra of 2 x 2-matrices with entries in C, 
the involution being the conjugate transpose. Let </? be the normalized trace on 21, 
that is to say (p — |Tr. Define r : 21 ^ 21 by 

' an (^12 \ ^ f ciai2 

021 022 / V "^2021 On 

for some fixed Ci,C2 G C with |ci| < 1, \c2\ < 1, Ci 7^ 1 and C2 7^ 1. The conditions 
|ci| < 1 and IC2I < 1 are necessary and sufficient for (21, 99,r) to be a ^-dynamical 
system. Note that for any c e C with |c| < 1, it follows from the Mean Ergodic 
Theorem 2.4.1 that 

n—1 

n ^ 

k=0 

converges to if c 7^ 1, and to 1 otherwise. Using this fact and Proposition 2.5.6(ii) 
with S = T = 21 (and some calculations), it can be verified that the conditions 
Ci 7^ 1 and C2 7^ 1 are necessary and sufficient for (21, (/?, r) to be ergodic, assuming 
that |ci| < 1 and \c2\ < 1. See Appendix A. 2 for more details, and Appendix B for 
a physically motivated example of an ergodic *-dynamical system. ■ 



2.6. MEASURE THEORY AND VON NEUMANN ALGEBRAS 



55 



2.5.8 Open Problem. As mentioned in Section 2.1, the converse of Theorem 2.5.2 
holds in the measure theoretic case. In general the question is as follows (also see 
Proposition 2.5.6(ii)): Consider a *-dynamical system (21, y?, r), and (/9-total sets & 
and % in 21, such that for every A e &* and Bel with v^(A) 7^ and i^iB) ^ 0, 
there exists a /c e N for which (pi^Ar^iE)) ^ 0. Is (21, r) necessarily ergodic?B 

2.6 Measure theory and von Neumann algebras 

As was mentioned in Section 2.3, from a measure theoretic dynamical system (X, E, T) 
we obtain the *-dynamical system (i?oo(E), y?, r), where </'(/) = / fd^ and t(/) = 
/ o T. This allows us to apply the results of Section 2.5 to measure theoretic dy- 
namical systems. For example, if (X, S, /i, T) is ergodic, then we know from Section 
2.3 that (i?oo(S), r) is ergodic. Hence for this *-dynamical system Theorem 2.5.2 
tells us that for any ^4, S e E and £ > 0, the set 



is relatively dense in N, but this set is exactly the set F from Section 2.1. (Here 
X denotes characteristic functions, as before.) So we have answered our original 
question: 

2.6.1 Corollciry. Let (X, E, /x, T) he an ergodic measure theoretic dynamical system. 
Then for any A, S e E and £ > 0, the set 



is relatively dense in N. 

This result says that for every k e F, the set A contains a set yl fl T~''{B) of 
measure larger than ij,{A)n{B) — e, which is mapped into Bhy T^. Using a similar 
argument, Khintchine's Theorem follows from Theorem 2.5.1. 

Likewise, Propositions 2.5.5 and 2.5.6 can be applied to the measure theoretic 
case. For example. Proposition 2.5.5(i) tells us that if (X, E,/x, T) is ergodic, then 



as n — s> 00, for every / e Soo(E). Note that this result is not pointwise and is 

therefore not quite as strong as the usual measure theoretic statement of equality 
of the time mean and the phase space mean. This is of course where Birkhoff's 
Pointwise Ergodic Theorem comes into play (see for example [Pete]). 



{/c e N : |(^ {xat^Xb)) I > \v{xaMxb)\ - e] 



F = [k en-. ^l{A{^T-^ 



{B)) > i^{A)i^{B) - e] 




2 



(6.1) 
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What about the converse? Well, in order to effectively apply Propositions 
2.5.5(ii) and 2.5.6(ii) to the measure theoretic case, we need to know what the 
measure theoretic significance of a <^-total set in B^{Y1) is. The basic fact we will 
use is the following simple proposition: 

2.6.2 Proposition. Let (X, E,//) be a probability space and set (fi{f) — J fd/i for 
all f e Soo(E). Then the set % — {xs : -S" e E} is (p-total in Soo(E). 

Proof. The same argument as in the proof of Proposition 2.3.4, keeping in mind that 
11/11^ = (/ I/I' d/j.) = for all / e S^(E), shows that for any g e S^(E) there 

is a sequence simple functions s„ such that ||sn — g^H^ ^ 0. However, by definition 
a simple function is a linear combination of elements of T, so we conclude that the 
linear span of T is (^-dense in i?oo(S), which completes the proof. ■ 

Prom this we see that if (6.1) holds for all measurable characteristic functions /, 
then (i?oo(S), V9, r) is ergodic by Proposition 2.5.5(ii), hence (X, E,//,!") is ergodic 
as mentioned in Section 2.3. 

Finally, with reference to Proposition 2.5.6(ii), we note that %* — % for T as in 
Proposition 2.6.2. 

Next we briefly look at von Neumann algebras, as they are well-known examples 
of unital *-algebras. Consider a von Neumann algebra 9Jt and suppose (9Jl, ip, r) 
is a *-dynamical system. For example, r might be a *-homomorphism leaving (f 

invariant, that is to say, ip{T{A)) = ^p{A) for all A e Then the results of Section 
2.5 can be applied directly to (njt, (/9,r). As a more explicit (and ergodic) example, 
we note that 21 in Example 2.5.7 is a von Neumann algebra on the Hilbcrt space C^. 
We can also mention that r in Example 4.7 is not a homomorphism (see Appendix 
A.2). 

We now describe one suitable choice for the ip-total sets appearing in Proposi- 
tions 2.5.5 and 2.5.6. Let ^ be the projections of 971. It is known that Wt is the 
norm closure of the linear span of as is mentioned for example on p. 326 of 
[KRl]. Since any state 99 on 9Jt is continuous by virtue of being positive (see [BR, 
Proposition 2.3.11]), it follows that ^ is <^-total in 9Jt. Note also, regarding Propo- 
sition 2.5.6(ii), that ^* = ^. This is all very similar to the measure theoretic case 
in Proposition 2.6.2, since the measurable characteristic functions on X are exactly 
the projections of i?oo(S). This similarity should not be too surprising, since the 
theory of von Neumann algebras is often described as "noncommutative measure 
theory" because of the close analogy with measure theory. 
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2.7 An alternative approach to recurrence 

In this section (which is based on work contained in [D2]) we discuss an alternative 
approach to recurrence which does not require to be hnear or complex- valued as 
in Definition 2.3.1. The lack of linearity in this approach however precludes the 
use of the GNS construction and Hilbert spaces, and because of this it does not 
give any quantitative result as in Khintchine's Theorem and its noncommutative 
generalization Theorem 2.5.1. 

As we shall see, the theory is surprisingly close to the usual measure theoretic 
setting. It therefore seems appropriate to briefiy review a Poincare-like probabilistic 
recurrence result. Consider a measure space (X, with IJ,{X) < oo, and let T : 
X ^ X he a mapping such that n{T^^{S)) = n{S) for all S in E. This is merely an 
abstraction of Liouville's theorem. For some S eT,, suppose that iJ,{Sr\T'~'^{S)) — 
for all neN. For aU n,k eNwc then have fx{T-''{S) D T-("+'=)(5)) = fx{T-''{S H 
T-"(5))) = i2{S n T-"(^)) = 0. So /x(r-"^(5) n r-"(5)) = O for all m, n e N with 
m ^ n. It follows that 

(n \ n n 

U T-'{S) = J2Kt-\s)) = MS). (7.1) 

k=l / k=l k=l 

Note that the weaker condition n{T^^{S)) < n{S) appearing in Khintchine's The- 
orem 2.1.1 would not be good enough to ensure this inequality. Letting n — > oo it 
follows that IJ'{S) — 0. This is a recurrence result, namely if fi{S) > 0, then there 
exists a positive integer n such that fi{S r\T~"'{S)) > 0. It tells us that S contains a 
set S nT~"'{S) of positive measure which is mapped back into S by T". From (7.1) 
it is clear that the intuitive idea is simply that we cannot fit an infinite number of 
sets the size of S into X without the sets overlapping, since X is of finite size (where 
the size of a set is its measure). This is similar to the pigeon hole principle. 

Note that the mapping g i— > r{g) = g o T is a *-homomorphism of the *- 
algebra BaoiJ^) into itself such that ip{r{g)) = ^p{g) by Proposition 1.7.1, and 
/.(5nT-"(5)) = 99 (X5r"(xs)) forces, where 99(5) = ^ gdi^ioi aWg e B^{T.). Us- 
ing this notation the recurrence result above can be stated as follows: If 99 (xs) > 0, 
then there exists a positive integer n such that </? {xs'T^ixs)) > 0. The general 
♦-algebraic approach will now be modelled after this situation. We also get some 
inspiration from Postulate 1.2.1, for reasons which will become clear in Section 3.1. 

For an element A of a *-algebra 21, we write A > if A = R*R for some € 21. 
If also A 7^ 0, we write A > 0. By A < 5 we mean that B - A>{). 

2.7.1 Definition. Let % he a *-algebra, and 03 a unital *-algebra. Let 99 : 21 — > 03 
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be a positive mapping (i.e. (p{A*A) > for all A e We call (p additive if 

n 
k=l 

for any projections Pi,...,P„ e 21 for which (p{PkPiPk) — if k < I. We call (p 

faithful if it is linear, 21 is unital, ipil) = 1, and (p{A*A) > for all non-zero A 
in ^ ( note that this requires that A* A ^ for A ^ 0, which is true for example in 
any C*- algebra). 

2.7.2 Proposition. If the positive mapping (p given in Definition 2.7.1 is faithful, 
then it is also additive. 

Proof. Let Pi,...,P„ e 21 be any projections for which (p{PkPiPk) — if k < I. 
For k < I we then have p {{PiPk)* PiPk) = 0, so PiPk = 0, and therefore PkPi = 
{PiPkY = 0. This implies that 

n 

k=i 

since the left-hand side is a projection in 21. Thus 

5]V9(P,) = V^ K]PJ <ip{l) = l 

k=l \k=l ) 

as promised. ■ 

2.7.3 Remcirk. In the measure theoretic setting described above, we can assume 
without loss of generahty that ^liX) — 1. Then p : Poo(5^) — > C is a linear additive 
mapping, since 

n n / ^ \ 

Y^^ixs,) = = ^^[[js^< t,{x) = 1 

k=l k=l \k=l / 

for any ^i, S'n G E such that ip (xSkXSi) = A* i^k H Si) = if k ^ I. However, p 
need not be faithful, since there can be a non-empty set 5" of measure zero (giving 
fiXsXs) — even though xs 0); which is why we introduced the notion of 
additivity.B 

We now state and prove a *-algebraic version of the recurrence result described 
above: 

2.7.4 Theorem. Consider a *-algebra 21 and a unital C*-algebra !B, and let ip : 
21 — >• 03 be an additive mapping. Let t : ^ ^ ^ be a *-homomorphism such that 
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ip{t{PQP)) = ip{PQP) for all projections P,Q E^il. Then, for any projection P e 
21 such that (fi{P) > 0, there exists a positive integer n such that (f{PT"'{P)P) > 0. 

Proof Note that ^{Pt''{P)P) = ^ {{t'^ {P) P)* t'' {P) P) > for all n G N, since r is 
a *-homomorphism. We now imitate the measure theoretic proof. 

Suppose (/7(Pr"(P)P) = for all n e N. For all /c, n e N we then have 

LP {t\P)t''+^{P)t\P)) = if (r'^ {Pt''{P)P)) = ip {Pt''{P)P) = 

since r is a homomorphism and P and therefore t"(P) are projections. Since (p is 
additive, it follows for any n eN that 

n 

J2v{r'{P))<l. 

k=l 

Furthermore, 

n n 

J2'i^{r'{P))=Y.^iP)^n^iP)>0 

k=l k=l 

since P = PPP, ip is positive and P = P*P. Hence < n(p{P) < 1, and therefore 
n \\ip{P)\\ < 1 since 03 is a C*-algebra (see [Mu, Theorem 2.2.5(3)]). Letting n — > oo, 
it follows that ip{P) = O.B 

It is clear that because of Remark 2.7.3, the measure theoretic recurrence result 
described above is just a special case of Theorem 2.7.4, since the projections of the 
*-algebra Boo{^) arc exactly the characteristic functions Xs, where S* G E. 

Note that the trace tr: OJt ^ OJt fi 97t'of a finite von Neumann algebra is faithful 
in the sense of Definition 2.7.1, hence we have the following corollary of Theorem 

2.7.4 and Proposition 2.7.2, which will be used in Section 3.1: 

2.7.5 Corollary. Consider a finite von Neumann algebra and let tr be its trace. 
Let T : 9Jl — > 9Jl &e a * -homomorphism such that tr(r(A)) = tr(A) for all A in VJl. 
Then, for any projection P G 9Jl such that tr(P) > 0, there exists a positive integer 
n such that tr(PT"(P)) > 0. 

We conclude this chapter with an open problem inspired by Theorem 2.7.4: 

2.7.6 Open Problem. Docs Theorem 2.5.1 still hold if we only assume that ip is 
?B-valued, instead of complex-valued, where *B is any unital C*-algcbra? In fact, we 
can ask if we can obtain the whole theory in Sections 2.3 and 2.5 if in Definition 
2.3.1 we generalized the framework to (p being OS-valued instead of complex- valued. 
A possible line of attack is to use Hilbert C*-modules (see [La]).B 
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Chapter 3 

Recurrence and ergodicity in 
mechanics 

In this chapter we discuss recurrence and ergodicity in certain physical systems 
(quantum and classical). In Section 3.1 (which is based on [D2]) it is shown that 
recurrence takes place in a probabilistic sense in exactly the same way in bounded 
quantum systems as in classical systems with finite volume phase space. In Section 
3.2 we show under physically reasonable assumptions that quantum and classical 
systems are not ergodic in the sense of Definition 2.3.2 (or, equivalently, in terms 
of the characterization in Proposition 2.5.5), if the state of the system allows more 
than one energy level to be obtained in a measurement (i.e. if more than one energy 
level has a nonzero probability). 

3.1 Recurrence 

Consider a bounded quantum system {dJl,Sj, H) and assume that 371 is a factor. Let 
T be the system's time-evolution, as in Proposition 1.7.5. Fix any f > 0. Since the 
trace tr of 9Jl is faithful, Corollary 2.7.5 and Proposition 1.7.5 tell us that for any 
nonzero projection P e 9Jt there exists an n{t) e N such that 

ir{PTn[t)t{P)) >0. (1.1) 

Note that ti {PTn{t)t{P)) = ^'^{P^n{t)t{P)P)) which has the form of u' in Postulate 
1.2.1, i.e. the state after a "yss" was obtained in a yes/no experiment with projection 
P when the initial state was tr. Also remember that according to Postulate 1.8.3, 
tr is the state of no information. 

So, to interpret (1.1), consider the case where we have no information about the 
state of our bounded quantum system. By Postulate 1.8.3 the state is then given 
by tr. At time we perform a yes/no experiment with projection P G 9Jl on the 



61 



62 



CHAPTER 3. RECURRENCE AND ERGODICITY IN MECHANICS 



system. Assuming the result is "yes" , the state of the system after the experiment 
is given by the state a; on 3Jl defined by 

uj{A) = tr(PA)/tr(P), 

according to Postulate 1.2.1. (Also recall from Section 1.2 that the probability of 
getting "yes" is tr(P), therefore tr(P) > in this case.) By (1.1) we then have 

p{t) := uj{Tnit)t{P)) > 0. (1.2) 

This simply tells us that if we were to repeat the above mentioned yes/no experiment 
exactly at the moment n{t)t, when its projection is given by Tn{i)t{P) according to 
Section 1.4 (iv), then there is a nonzero probability p(i) that we will again get "yes". 
By replacing t hy t' = n(t)t + 1, we see that there is in fact an unbounded set of 
moments n{t)t < n{t')t' < ... for which (1.2) holds. 

So we have obtained a quantum mechanical version of recurrence. Note that 
the measure theoretic recurrence result described in Section 2.7 will give exactly the 
same result as (1.2), with the same physical interpretation, when applied to a clas- 
sical mechanical system whose phase space (see Remark 1.7.2) has finite Lebesgue 
measure; just replace uj, tr, r and P by their classical analogues described in Sections 
1.3 and 1.8. In particular, tr is replaced by integration with respect to normalized 
Lebesgue measure, which then represents the state of no information. So we see that 
(probabilistic) recurrence in quantum mechanics and in classical mechanics follow 
from the same general result, namely Theorem 2.7.4, since Corollary 2.7.5 and the 
measure theoretic recurrence result are both special cases of this theorem. 

A drawback of (1.2) is that it gives no indication as to how large a;(r„(t)i(P)) is, 
or how often it is positive. Theorem 2.5.1 on the other hand, tells us that for any 
£ > there is in fact a relatively dense set M in N such that 

OjiTmtiP)) > tr(P)-£ (1.3) 

for all m G M, which is a quantitative improvement over (1.2), since it says that 
'-^(''mt(-P)) is regularly (i.e. almost periodically) larger than tr(P)— e. Since tr(P) was 
the probability of getting a "yes" during the first execution of the yes/no experiment, 
we see from (1.3) that at the moments mt the probability of getting "yes" when doing 
the experiment a second time is larger or at least arbitrarily close to the original 
probability of getting "yes" . Similar results concerning wave functions and density 
operators are presented in [HH] and [Perc]. If as before we replace u, tr, r and 
P by their classical counterparts, and then apply Theorem 2.5.1 again, we find the 
same result as (1.3) for classical mechanics, with exactly the same interpretation as 
in quantum mechanics. 

There is, however, a small technical problem: The probability of repeating the 
yes/no experiment exactly at the moment n{t)t is zero. The same goes for any of 
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the moments mt above. The next simple proposition remedies the situation in the 
quantum case: 

3.1.1 Proposition. Let r be as in Proposition 1.7.5, where we take Wl to be a 
finite factor. Then for any projection P in SDT, the mapping 

R^R:t^ ti{Prt{P)) 

is continuous, where tr is the trace o/9Jl. 

Proof. By Stone's Theorem Ut in Proposition 1.7.5 is strongly continuous (i.e., 
t I— > UtX is continuous for every x E Sj), so clearly the mapping 1 1— > Tt{A) is weakly 
continuous for every A e 3Jl (i.e., t i— > {x,Tt{A)y) = {UtX, AUty) is continuous 
for any x,y e S)). Hence t PTt{P) is weakly continuous. We know that tr is 
ultraweakly continuous (see [KR2, Theorem 8.2.8], for example), and therefore it 
is weakly continuous on the unit ball of DJl by [KR2, Proposition 7.4.5]. Since 
||Prt(P)|| < 1, we conclude that t i— >• tr(Prt(P)) is continuous. ■ 

So from (1.3) we see that for every m e M there exists a, Sm > such that 

u{ts{P)) > tr(P) — £ for mt — Sm < s < mt + 6m. 

This tells us that quantum mechanical recurrence is possible in practice, assuming 
we are working with a bounded quantum system as above, since there is a non- 
zero probability of repeating the yes/no experiment during one of the time-intervals 
{mt — 5m, TTit + Sm)- It should be mentioned though, that the elements of M might 
be very far apart, so we might have to wait very long after the initial yes/no exper- 
iment before the probability tr(P) — e is reached as in (1.3). 

According to Conjecture 1.9.1, a quantum mechanical system bounded in space, 
and isolated from outside influences, can be mathematically described as a bounded 
quantum system. So this is the physical situation for which we could expect recur- 
rence as above. This guess is confirmed by [BL] and [Perc] . In classical mechanics 
we indeed have recurrence for systems with finite volume phase space, in particu- 
lar for a system with bounded phase space in M^**, which corresponds to a system 
bounded in space and isolated from outside infiuences (see Section 1.9). This fact 
constitutes some additional circumstantial evidence for Conjecture 1.9.1. 

3.2 Ergodicity 

In Section 3.1 we saw how recurrence comes about in mechanics in terms of the state 
of no information (tr in the quantum case; integration with respect to normalized 
Lebesgue measure in the classical case). What is important here, is that when we 
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applied Theorem 2.5.1 (and Theorem 2.7.3) to mechanics, we took cp to be the state 
of no information. 

Say we also want to apply Theorem 2.5.2 to mechanics to find the following 
result: We consider two yes/no experiments with projections P and Q at time zero, 
for a given system. The P experiment is performed when we have no information 
regarding the systems state (i.e. we start with the state of no information ip), and 
a "yes" is obtained, changing the state to lo defined by u;{A) — ip(PA)/ip{P). We 
want to know if a subsequent execution of the Q experiment (at one of the points in 
time from the set E in Theorem 2.5.2) will give "yes" with probability ip{Q) — e or 
larger, where E depends on £ > 0. This is a simple extension of the recurrence result 
we found in Section 3.1 (see in particular equation (1.3)). However, for Theorem 
2.5.2 to be applicable, we need the system to be an ergodic *-dynamical system. In 
this section we show that under physically reasonable assumptions, we do not have 
ergodicity. (However, to prove that this implies that for any fixed t > there is a 
pair P and Q as above with (p{P) > and (p{Q) > 0, such that the probability for 
a "yes" in the Q experiment is zero at all discrete times kt, k & N, we would first 
have to solve Open Problem 2.5.8.) 

3.2.1 Definition. Consider a quantum or classical mechanical system (21, (/?, ) 
where 21 is the observable algebra of the system, (p is the state of no information (we 
assume that it exists) and Tt is the time- evolution. We call the system bounded if 
it is either a bounded quantum system (9Jl,tr, Tt) where dJl is a finite factor with tr 
its trace and Tt defined as in Proposition 1.7.5, or a classical system {B^{F),(fi,Tt) 
whose phase space F C M^" (see Remark 1.7.2) has finite Lebesgue measure, where 
Lp{g) = (/ gd\) /A(F) with A the Lebesgue measure on M^"", and Tt is given by 
equation (3.3) in Section 1.3. 

Note that because of Liouville's Theorem (equation (7.1) in Section 1.7) and its 
quantum analogue. Proposition 1.7.5, a bounded mechanical system (21, v?, r^) is a 
^-dynamical system as defined in Definition 2.3.1, for any fixed t. Our goal in this 
section is therefore to show that under physically reasonable assumptions, such a 
system is not ergodic. Actually we will prove the more general result that if the 
state of a system allows more than one energy level (in the sense of Definition 3.2.3), 
then we do not have ergodicity. 

We will work in the following general setting: 

3.2.2 General Setting. Let 21 be the observable algebra of a physical system 
(quantum or classical), and H the system's Hamiltonian (remember that the Hamil- 
tonian of a system gives the system's energy). 21 is a unital *-algebra. In the classical 
case we assume 21 to be an algebra of bounded complex- valued measurable functions 
on some measurable space F with g* = g the involution, and we assume if to be a 
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(possibly unbounded) measurable function F — > R. In the quantum case we assume 
21 to be an algebra of bounded linear operators — > ^ on some Hilbert space 9) 
with the involution being the Hilbert adjoint, and we assume to be a (possibly 
unbounded) self-adjoint linear operator in S). Keep in mind that in the quantum 
case we allow the Hamiltonian to be represented in a Hilbert space which might 
not be the state space, as is the case in Definition 1.7.3 and Remark 1.7.4. That 
is to say, Sj is not necessarily the state space of the quantum system. For reasons 
of generahty, we hkewise do not assume that F is the phase space of the classical 
system. 

Furthermore, we assume that xv{H) e 21 for all Borel C M, where xviH) is 
given by the Borel functional calculus (in the classical case Xv{H) := xv ° H as in 
Remark 1.4.2), and that Xv{H) is the projection of the yes/no experiment "Is the 
energy in V7" (Note that if we were to take 2t = Boo{F) for a classical system, or 
21 = 2,{S)) for a quantum system, then 21 would contain all these projections in any 
case.) 

As always, we assume the time-evolution to be a one-parameter *-automorphism 
group T of 21 as in Section 1.4 (iv). In the quantum case it is given by 

Tt{A) = e'^'Ae-'"' 

and in the classical case by 

n(A)^AoTt 

where Tt is an energy conserving (i.e. H oTt — H) flow depending on H. (If the 
time-evolution does not conserve energy, then it means that the system is interacting 
with other systems. We could consider these systems as part of our system to ensure 
conservation of energy. The time-evolution for a quantum system as given above 
automatically conserves energy, since we take H to be fixed, so it does not allow 
interactions with other systems; see the proof of Theorem 3.2.7.) 
We then call (21, H) a mechanical system. 

Where reference is made to an observable of the system, it will be assumed to 
have the same mathematical form as H above. ■ 

We will assume that a bounded mechanical system is nontrivial in the sense that 
it has more than one distinguishable energy level. We have to state more clearly 
what we mean by this however. A simple way to do this in our framework is as 
follows: 

3.2.3 Definition. Consider a state u of a mechanical system (21, H) in the general 
setting above. (So uj is a state on %.) We say that uo allows more than one 

energy level if there are two open intervals Ji and J2 in R such that Ji fl J2 = 0; 
UJ {xJi{H)) > and uj {xj2{H)) > 0, and a bounded interval J in M. such that 
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uj{Xj{H)) > a hounded mechanical system (21, 93,rt) with Hamiltonian H is 
called nontrivial if </? allows more than one energy levelM 

3.2.4 Remark. Definition 3.2.3 says that if we have the state uj for the system, and 
we measure the energy, then there is a nonzero probabihty of getting a value in Ji, 
and a nonzero probabihty of getting a value in J2. In this sense then, more than one 
energy level of the system can be distinguished, since Ji and J2 are separated (i.e. 
Jifl J2 = 0). The existence of the bounded interval J implies that the system has at 
least one finite energy level (this is a sensible assumption and not at all restrictive, 
since in practice one can generally assume that a physical system does not possess an 
infinite amount of energy; note that when modelling a physical system, some useful 
models might have an infinite amount of energy, for example in the thermodynamic 
limit [Rue], but in this thesis we consider the system, rather than a model which 
deviates from the system in such a nonphysical way). 

If the state of no information of a bounded mechanical system does not allow 
more than one energy level (in the technical sense given in Definition 3.2.3), then 
it effectively means that the system only has one energy level (i.e. it is physically 
trivial), since in the state of no information all energy levels should be equally 
likely. ■ 

3.2.5 Lemma. For Borel sets U,V <zM. with U CV we have 

Xu{A) < xv{A) 

where A is an observable of a mechanical system as in General Setting 3.2.2. 

Proof. In the classical case this is easy, namely 

Xu(A) = ^ Xa-hv) = Xv(A) 

since A~^{U) C A~^{V). Alternatively (as harbinger to the quantum case below), 
one can note that 

Xu{A)xv{A) = ixuXv) (A) = Xu{A) 

since U cV, hence Xu{A) < Xv{A). 

In the quantum case it follows from the properties of the Borel functional calculus 
[SZ, 9.11(v), 9.13(in) and 9.32] and the fact that a bounded linear operator on a 
Hilbert space is closed, that 

Xu{A)xv{A) = ixuXv) (A) = xu{A) 

and hence XuiA) < xviA), since Xu{A) and Xv{A) are projections (see [Mu, The- 
orem 2.3.2] for properties of projections).^ 
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3.2.6 Proposition. Consider a state uj of a mechanical system (21, H) which allows 
more than one energy level in the sense of Definition 3.2.3. Then there exists a 
hounded interval I in M. such that < u!{xi{H)) < 1. 

Proof. Write p{V) := uj{xv{H)) for allBorel sets F C M. {p{V) is the probability 
for a "yes" in the yes/no experiment "Is the system's energy in V7") Suppose that 

p(/)e{0,i} (2.1) 

for all bounded intervals / in M. By assumption there exists a bounded interval 
/o in M such that p(/o) > 0, and hence p(/o) = 1- Because of Lemma 3.2.5, we 
can assume without loss that this interval is of the form Iq — [ao,&o)- We now 
inductively construct a sequence Iq, Ii, I2, ■■■ of intervals such that p(/„) = 1 for all 
n: 

Divide /„ in its left and right halves (each of the form [c,d)), and let In+i — 
[an+i, bn+i) be the half such that p(/„_|_i) = 1. 

Note that exists by induction, since if it did not, we would have p{L) — 
p{R) — by (2.1), where L and R are the left and right halves of /„, and then by 
the properties of the Borel functional calculus (and arguments as in the proof of 
Lemma 3.2.5) 

= p{L) + p{R) = u {xl{H) + xr{H)) = u; {{xl + Xr){H)) = p{In) (2.2) 

which contradicts p{Iq) > 0. The sequences (a„) and (6„) are bounded, and increas- 
ing and decreasing respectively, while bn — Un — (&o ~ ^o) /2"'. This implies that they 
converge to the same value, say E. 

We can view E as the only energy level of the system that can be obtained 
in a measurement, since any open set V containing E contains an /„, and hence 
1 = p{In) < p{y) < 1 by Lemma 3.2.5, so the probability for a "yes" in the yes/no 
experiment "Is the energy in V7" is one. The idea is therefore to get a contradiction 
with Definition 3.2.3, which says that there are at least two energy levels. So consider 
any open intervals Ji and J2 in M with Ji fl J2 = 0. 

Case 1. Say E e Ji. Thenp(Ji) = 1 as for p{V) above. It follows that p{J2) = 0, 
otherwise we would have 

p(JlU J2) =p(Jl)+p(J2) > 1 

similar to (2.2), which contradicts the definition of p. (Similarly if we had E G J2.) 

Case 2. Now suppose E ^ JiU J2. Since JiH J2 = 0, we can assume without loss 
that E ^ Ji, which implies that an exists such that /„ C as for V above. 

So by Lemma 3.2.5 we then have p(R\ Ji) = 1 and also p(M) = 1, and therefore (in 
the same way as (2.2)), 

p (Ji) = p{R) - p (R\ Ji) = 0. 
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So, again by Lemma 3.2.5, < p{Ji) < p{Ji) = 0. 

Prom these two cases we see that we either have p{Ji) = or p(J2) = 0, con- 
tradicting the assumptions. Therefore (2.1) must be wrong, which means that 
< p{I) < 1 for some bounded interval /.■ 

3.2.7 Theorem. Consider a state cu of a mechanical system (21, H) which allows 
more than one energy level in the sense of Definition 3.2.3, and let r he the time- 
evolution of the system as in General Setting 3.2.2. Fix any t G M, and assume that 
(2l,a;,ri) is a *-dynamical system (i.e. uj{Tt{A* A)) < uj{A*A) for all A G Then 
(21, a;,rt) is not ergodic (in the sense of Definition 2.3.2). In particular, a nontrivial 
hounded mechanical system (as in Definitions 3.2.1 and 3.2.3) is not ergodic. 

Proof. By Proposition 3.2.6 there is a Borel set \^ C M such that < oj{P) < 1 for 
P ■■= XviH). 

By conservation of energy in the classical case, we have H o Tt = H, hence 
Tt{P) — xv ° H o Tt — xv ° H — P. In the quantum mechanical case we have 
Tt{P) = e'^\v{H)e-'^* = (e^( )*xve-^(-)*) {H) = Xv{H) = P by the properties of 
the Borel functional calculus [SZ, 9.11(v)], which says that energy is conserved. So, 
in either case 

rt{P) = P. (2.3) 
Consider any ai, a2 G C and set A :— aiP + 02(1 — P). Now set 

^ n—l 

:= - J] rt{A) and C„ := S„ - u;{A) 

k=0 

then Bn — A hy (2.3) since Tt(l) = 1. Write p :— oj{P), then it follows that 

On = aiP + 02(1 - P) - aip - 02(1 - p) 
= {ai - 02) (P -p) 

and therefore 

l|Cn|L = V^(C*C„) = |ai - aal ^/ p{l - p) 

«2| Vp(1-p)7^0 

Therefore the system is not ergodic, by 

The system in Example 2.5.7 is ergodic despite the fact that tr is the state of no 
information, simply because the "time-evolution" r behaves differently from that of 



so 

n— »oo 

if we choose ai 7^ 02, since < p < 1. 
Proposition 2.5.5(i).B 
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a physical system as in Theorem 3.2.7. In the ergodic case, r in Example 2.5.7 only 
has fixed points of the form 



which is only a projection if a e {0, 1}, hence a projection P as in (2.3) with < 
tr(P) < 1 does not exist. One can say that r does not preserve the various "energy 
levels" of the system, but only preserves the system as a whole. 

3.2.8 Remarks. Essentially Theorem 3.2.7 says that if the state is a mixture of 
more than one energy state (so more than one value of energy has nonzero probability 
when the observer measures the energy) , then the state is not ergodic (in this context 
it makes more sense to speak of an ergodic state, rather than an ergodic system, 
since the state describes the observer's information about the physical system as in 
Section 1.6, rather than being a property of the system itself). From the statistical 
point of view that we have been using since Chapter 1, this should be the typical 
situation in practice, since normally an observer would not be able to measure the 
energy precisely enough to give a state allowing only one energy level. So if the 
observer does not have complete (or precise) information about the system's energy, 
then the state describing his information isn't ergodic. 

Intuitively Theorem 3.2.7 makes perfect sense. If more than one energy level 
is present in the state, then we can imagine decomposing it into its various energy 
"components" (for example, decompose the phase space into its constant energy 
surfaces in the case of a classical system; see below). By the conservation of energy, 
the time-evolution does not mix the various energy components with each other. 
But this clearly violates the basic intuition behind ergodicity, namely that in an 
ergodic system, any "part" is eventually mixed with every other part (see Corollary 
2.6.1 and the discussion following it, as well as Theorem 2.5.2 and Proposition 2.5.6, 
which all say that any part of an ergodic system eventually overlaps with every other 
part) . So it is also clear why conservation of energy plays a central role in the proof 
of Theorem 3.2.7. 

This result does not mean that the idea of ergodicity is in principle irrelevant in 
physics. Theoretically one can still consider states allowing only one energy level, 
and study whether they are ergodic or not. For example, a state given by any 
probability measure on a constant energy surface (given by H = E, where E is the 
energy of the surface) of a classical system, by definition allows only one energy level 
E, while each energy eigenstate of a quantum system (assuming the Hamiltonian has 
eigenvectors) by definition corresponds to a single energy level (also sec Appendix 
B). Ergodicity would then be a property of the system, rather than of the observer's 
information, which in the light of Theorem 3.2.7 seems like the sensible approach to 
ergodicity in physics. 
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In classical mechanics ergodicity arises in the sense that one would consider 
systems where for almost every pure state (point) x in a constant energy surface, 
the time average 

^ n—l 

-Y^foTkt{x) 

k=l 

of any observable / converges to the average uj{f) of the observable over the constant 
energy surface, for any fixed t > 0, where the state uj of the system is given by a 
time-invariant probability measure on the constant energy surface (the existence of 
such a measure follows from Liouville's Theorem; see for example [Kh, Section 7] 
or [Pete, Chapter 1, Proposition 2.2], and also [Rue, Section 1.1]). Since only 
one energy level is involved, this is not in conflict with Theorem 3.2.7. We can 
mention that in 1962-63 Sinai succeeded in proving that a classical gas, consisting of 
hard spheres enclosed in a box and interacting through pair potentials, is ergodic in 
this sense (refer to [AA, Section 18] or [Rue, Section 1.1] and references therein). 
Ergodicity as given by Definition 2.3.2, or equivalently by equation (6.1) in Section 
2.6, with if = uo and the probability measure on the constant energy surface, is a 
slightly weaker form of ergodicity. Refer to [Rud, Theorem 3.12] for the connection 
of this with the almost everywhere convergence mentioned above, namely that it 
implies the existence of a subsequence of the time-averages 

^ n— 1 

n ^-^ 

k=l 

converging pointwise almost everywhere to oj{f), whereas for the case above the 
whole sequence converges pointwise almost everywhere to u;(/). 

In quantum mechanics the idea is to study states that are ergodic in some sense, 
the simplest approach being to take eigenstates of the Hamiltonian (if they exist) 
as ergodic, since for such an eigenstate x we have e~*^*x = e'^^^x where E is the 
corresponding eigenvalue (the energy), and hence for any fixed t 

n— 1 n—l _ n—l 

W(^)) = - I] nHA)x) = - ^ {e-'^^^'x, Ae-'^^^'x) = {x, Ax) =: cu{A) 

Th Th Th 

k=0 fc=0 k=0 

which is an equality of a time average and a "state average" . (Also see [T, Remark 
(3.1.23;!)].) This is a very primitive form of ergodicity of a state. For a deeper 
approach, refer to [T, Sections 3.1 and 3.2], and in particular [T, Remarks (3.2.10;6) 
and (3.2.16;1)] for the relation between ergodicity and KMS states (equilibrium 
states). Also see Appendix B for a more precise description of the ergodicity of the 
Hamiltonian's eigenstates. 

The unfortunate situation in quantum mechanics (as far as ergodicity goes), is 
that even if the system is in a state containing only one energy level, measuring an 
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observable not commuting with the Hamiltonian will typically leave the system in a 
state which does contain more than one energy level, in which case it can no longer 
be ergodic. But as mentioned earlier, this still doesn't stop us from studying those 
states which are ergodic. 

For more on quantum ergodicity, see [NTW], [ENTS] and [Z].B 



CHAPTER 3. RECURRENCE AND ERGODICITY IN MECHANICS 



Appendix A 

Examples concerning ergodicity 



A.l On the definition of ergodicity 

This section is devoted to the construction of a >K-dynamical system (21, ip, r) with 
the property that if ||t(A) — A\\^ = 0, then \\A — a\\^ = for some a E C, but 
for which the fixed points of the operator U defined in Proposition 2.3.3 in terms 
of some cychc representation, form a vector subspace of with dimension greater 
than one. This will prove the necessity of a sequence, rather than a single element, 
in Definition 2.3.2, in order for Proposition 2.3.3 to hold. 

First some general considerations. Consider a dense vector subspace of a 
Hilbert space Sj, and let be the bounded hnear operators Sj ^ S). Set 

21 := : A e £(^), A(3 G (3 and A*(3 C 0} 

where y4|© denotes the restriction of A to 0, then 21 is clearly a vector subspace of 
£(0). For any A e 21, denote by A the (unique) bounded hnear extension of A to 
Sj. Now define an involution on 21 by 

A* :=T|e 

for all A e 21, then it is easily verified that 21 becomes a unital *-algebra. (For 

A, S e 21 it is clear that AB is a bounded linear operator — > which therefore 
has the extension A.B G £(fi) for which Z.50 C and (A.'B)* <& = B*T0 C 
by the definition of 21. Hence AB G 21, which means that 21 is a subalgcbra of £(0). 

Also, (AB)* = (A.bY |g = (b*A*^ \^ = B*(T\^^= B*A* = B*A*. Similarly for 

the other defining properties of an involution.) Note that for ^4 G 21 and x,y E (3 
we have 

{x, Ay) = {x, Ay) = /a*x, y) = {A*x, y) . 
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APPENDIX A. EXAMPLES CONCERNING ERGODICITY 



For a given norm one vector Q e we define a state </? on 21 by 

ip{A) = {n,An) . 

Next we construct a cyclic representation of (21, (p) . Let 

7r:2l^L(6) -.A^A 

tlien clearly n is linear with 7r(l) = 1 and 7i{AB) = 7i{A)7i{B). Note that for any 
x,y e (3 we have {x y)* — y <Si x, hence (x (8) y)0 C and (x (8) C 0, so (x (8) 
y)\e G 21. Now, vr ((x eg) Q)!©) O = x {Q, Q) = x, hence 7r(2l)i7 = (5. Furthermore, 
{n{A)n,TT{B)n) = {An, En) = = Thus (0,7r,O) is a cyclic 

representation of (21, ip) . 

Suppose we have a unitary operator U : ^ such that 1/(3 = ^ and ^7^2 = ^2. 
Then U*(5 = C/-^6 = 6, so F := C/je e 21, and V* = C/*|©. It follows that 
VAV* e 21 for all A e 21, hence we can define a hnear function r : 21 — > 21 by 

t(A) = VAV*. 

Clearly V*V = 1 = VV*, so r(l) = 1 and (T(^)*r(A)) = ip{VA*AV*) = 
= ip{A*A), since = [/-^fi = Q. Therefore (2t,99,r) is a *- 
dynamical system. Note that U\e satisfies equation (3.1) of Section 2.3, namely 
U'K{A)n = UAn = UAU*n = T{A)n = n {t{A)) n, hence U is the operator which 
appears in Proposition 2.3.3. 

Assume {,x G : Ux = x} = CQ. If ||T(yl) — A\\^ = 0, it then follows for x — 
t{A), with L given by equation (2.1) of Section 2.2, that \\Ux — x\\ = \\t. {t{A) — A)\\ — 
\\t{A) - A\\^ = 0, so X = for some a eC. Therefore \\A - a\\^ = \\l{A - a)\\ ^ 
\\x - an\\ = 0. 

In other words, assuming that the fixed points of [/ in C5 form the one-dimensional 
subspace Cfl, it follows that \\t{A) — ^H,^ = implies that ||y4 — q;||^ = for some 
a e C. 

It remains to construct an example of a C/ with all the properties mentioned 
above, whose fixed point space in has dimension greater than one. The following 
example was constructed by L. Zsido: 

Let So he a separable Hilbert space with an orthonormal basis of the form 

{n,y}U{uk : ke Z} 

(that is to say, this is a total orthonormal set in S^) and define the linear operator 
U : — > via bounded linear extension by 

Uy^y, 
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Clearly U is isometric, while USj is dense in Sj, hence U is surjective, since ^ is 
complete. Since C/ is a surjective isometry, it is unitary. Let (25 be the linear span of 

{n}U{y + Uk: keZ). 

Then — <£>. Furthermore, 6 is dense in 9) . Indeed, 

^ n ^ n ^ 

\\y - -Y\{y + Uk)\\ = -liy^Mfcii = ^ — >o 

k=\ fe=l ^ 

imphes that y G , the closure of 0, hence also 

life = (y + Ujfc) - y e 

for A; e Z. 

Next we show that 

{x e 6 : C/x = x} = CQ. (1.1) 

n 

If + X) + life) e ^ is left fixed by C/, then 

fe=— n 

n n n n 

fe=— n fe=— n fe=— n fe=— n 

and it follows that = 0, and that /J^+i = /J^ for /c = — n, n — 1. Thus 

n 



k=—n 

proving (1.1). 

On the other hand, 

{x & S) : Ux = x} 
clearly contains the two-dimensional vector space spanned by ^2 and y . 
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APPENDIX A. EXAMPLES CONCERNING ERGODICITY 



A. 2 An example of an ergodic system 

Here we give the proof that Example 2.5.7 is indeed ergodic. It is clear that r is 
linear and that r(l) = 1. Let 



and 



be complex matrices. Then 



B 



On ai2 

hu bi2 

^21 &22 



and 



while 



and 



so 



r{Ay 



0-22 ^2(321 



ciai2 an 



TiAYriA) 



a22 



|c2a2i| a22Ciai2 + C2a2iaii 



c\a\2a-22 + aiiC2a2i 
^* _ j Oil 021 

^12 O22 



A*A 



On 



021 



O11O12 + a2\(l22 



012011 + 022021 O12 + O22 



</? {t{A)*t{A)) = - (|a22|^ + |c202l|^ + \cxav2f + |Oll|^) 

(\ |2 , I |2 , I |2 , I |2\ 
<2(l'^22| +|02l| + IO12I +|Oii| j 

^^{A^A) 

for all A, meaning that (21, r) is a *-dynamical system, if and only if |ci| < 1 and 
|c2| < 1, which is what we will assume. 

Next we prove that it is ergodic. For even A; > we have 



&11 c\hx2 

C\h2\ &22 



and therefore 



At^{B) 



Oll&ll + O12C2&2I OllCifel2 + 012^22 
O2I&II + O22C262I O21C1612 + O22&22 



A.2. AN EXAMPLE OF AN ERGODIC SYSTEM 



77 



which means 



(f {At''{B)) = ^ (aii6ii + 0120^621 + a2iCi6i2 + 022^22) ■ 



For odd A; > we then get 

(fi {At''{B)) = ^ (011622 + ai2C2&2i + a2iCi6i2 + a22&ii) 

by switching bu and 622- For c e C it is clear that U :C— >C:a;i— s^cxisa hnear 
operator with \\U\\ < 1 if and only if |c| < 1, and for c 7^ 1 the only fixed point of U 
is 0, in which case 

n—l n—1 
k=Q k=0 

for all x e C as n — > 00, by the Mean Ergodic Theorem 2.4.1. Hence, for ci 7^ 1 and 
C2 7^ 1 it follows that 



fc=0 

If f [|(aii&ii + 022^22) + |(aii&22 + a22&ii)] for n even 

[|(aiifeii + 022^22) + |(aii&22 + 022^11)] + |(aii&ii + 022^22) for n odd 

an + a22\ fhi + ^22 



— lim 

71— »oo n 



2 

which means that (21, (/?, r) is ergodic, by Proposition 2.5.6(ii). 

On the other hand, if ci = 1 and C2 7^ 1, then we have by a similar calculation 
that 

lun^J2^{Ar\B))=^iAMB) + ^. 

n— >oo n ^ — ' Z 

A;=0 

Likewise for the other cases where either ci or C2 or both are equal to 1. So (21, ip, r) 
is ergodic if and only if ci 7^ 1 and C2 7^ 1. 



A. 2.1 Remark. It is easily seen that r is not a homomorphism, namely 
t{AB) = 



a2i&i2 + 022^22 Ci(aii&i2 + 042^22) 

£2(021^11 + 022^21) Oll^ll + 012^21 



while 



T(A)r(B) = ( '^22^22 + ciC2ai2fe2i ci (022^12 + auhi) \ 

V C2(a21&22 + 011621) CiC2a2l6l2 + 011611 J 
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In fact, unless C1C2 = 1, it follows that we don't even have t(A^) = r(A)^ for all 
A. Nor, for that matter, do we have t{A*) — t{A)* for all A, unless C2 = cT- This 
is opposed to the situation for a measure theoretic dynamical system as defined in 
Section 2.1, where r in equation (1.1) of that section is always a *-homomorphism. 
It therefore makes sense not to assume that r is a *-homomorphism in Definition 
2.3.1, since we now have an example where it isn't. ■ 

A. 2. 2 Remark. We note that lp{t{A)) = (p{A), i.e. cp is r-invariant, but this 
fact in itself docs not imply that (p{T{Ayr{A)) < (p[A*A), since r is not a *- 
homomorphism, by Remark A.2.1. 

Furthermore, <f{AB) — ip{BA) for all ^4, S e 21, so </? is commutative (so to 
speak) even though 21 is not. Also, while t{AB) 7^ t{BA) for some A, S G 21, we 
still have lp{t{AB)) = Lp{AB) = Lp{BA) = (f{T{BA)), so r is noncommutative (so 
to speak), but with respect to (/? it is again commutative. We conclude that while 21 
is noncommutative, (21, ip, r) is still in many respects commutative simply because 
^{AB) = (p{BA) for all A and BM 

A. 2.3 Question. Is there an example of an ergodic *-dynamical system (21, (p, r) 
in which ip{AB) ^ (p{BA) for some A, i? G 21? Yes, see the example in Appendix 

B. a 



Appendix B 

Ergodicity of energy eigenstates 



In this appendix, added to the thesis in December 2003^, we briefly study the ergod- 
icity of energy eigenstates, and in the process exhibit another example of an ergodic 
system. 

Consider any Hilbert space i5, and let r : — > be given by 

r{A) := e'^Ae-'" 

where if is a (possibly unbounded) self-adjoint linear operator in S^. Consider the 
state LU on £>{S)) given by 

uj{A) = {n,An) (1) 

for some unit vector Q E Sj. Then {Sj,idQ(^), Q) is a cyclic representation of (£(5^), a;), 
since {x (8) fl)fl = x, so £,{S))fl = S). Now set 

L : £{Sj) ^Sj:A^ An. 

as in Section 2.2. If we assume that 

e-'"n = e-'^ft (2) 

for some e M, then lu o t = lu, making {£,{S)) , cu , t) a *-dynamical system, and 
ensuring that 

U:S)^9): i(A) ^ i(T(A)) 

is well-defined just as in Section 2.3 and simphfies to UAQ — e~^^e^^An. For any 
X & we now have 

Ux = U{x ® Q)Q = e-'^e'^{x ® Q)Q = e-'^e'^x 

^but prepared as part of a lecture at the SA Mathematical Society's 45th annual conference in 
Stellenbosch in November 2002 
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APPENDIX B. ERGODICITY OF ENERGY EIGENSTATES 



so 

U = e-^^e^^. 

We have to look at the dimension of the fixed point space of U, since this is how 
we decide whether the system is ergodic or not, by 2.3.3. In terms of the projection 
P onto this space, we have 

PSj^{xeSj: e'^x = e'^x}. 

Since e*-^e^*^ = 1, we see from (2) that Q G PS^, as we know it must by the proof 
of 2.3.3. Suppose that E is degenerate in the sense that there exists a x G io\(Cf2) 
such that e~'^x = e~*^x, then exactly as above, x G PS), so dim{PS^) > 1, which 
means that {2{S)),u!,t) is not ergodic by 2.3.3. On the other hand, if such an x 
does not exist, then dim(P^) = 1, and hence (£(^),a;,T) is ergodic by 2.3.3. 

Remcirks. So for example, suppose that Ei and E2 are different eigenvalues of a 
Hamiltonian H such that E2 — Ei + 27rn for some n G Z. Let Q be a unit eigenvector 
of H corresponding to Ei, and let x be an eigenvector of H corresponding to E2, 
so x ^ Cfi. Then e"*^r2 = e~*^^fi and e~*^x = e~^^'^x = e"*^^^, which means that 
{£{Sj),u!,t) is not ergodic as explained above. 

Suppose however that there exists an orthonormal basis 61, 62, ^3, ••• for i^, con- 
sisting of eigenvectors of H, with corresponding eigenvalues Ei, E2, E^, ... where 
Ek ^ {El + 27m : n G Z} for k > 1, which means in particular that Ei is a nonde- 
generate eigenvalue (though some of the other E^s might be equal to each other and 
hence degenerate). Let Q = bi. Note that Sj could be finite or infinite dimensional. 
Then the system is ergodic: Consider any a; G ^. By assumption x — Ylik^kbk for 
some CKjfc G C. The condition e^^^x — e^'^^x, or equivalently e^^x — e^^^x, then 
implies that 

k k 

which means that ctfc = or 6*^*= = e*'^^ For A; > the latter contradicts the fact 
that Ek ^ {El + 27rn : n G Z}, hence = 0. But this means that x = aiQ G CQ, 
so PS) — CQ and hence dim(PS)) — 1. Note that instead of saying the system is 
ergodic, we could also say that the energy eigenvector ft is ergodic. 

This is complimentary to 3.2.7 and 3.2.8 where we saw that for a system to 
be ergodic, no more than one energy level is allowed to be present in the state 
(the intuition being that energy conservation would prohibit "mixing" if more than 
one energy level was present). Now we see that, assuming the existence of energy 
eigenvectors, there are even more restrictions for such an eigenvector to be an ergodic 
state, namely the energy E of the state should have a one dimensional eigenspace, 
and shouldn't differ from any other energy level by an integer multiple of 27r. In 
practice it is however quite possible that in a system where "theoretically" these 
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conditions aren't met, the presence of interactions and slight variations from place 
to place in the system might separate a higher dimensional energy eigenspace into 
lower dimensional eigenspaces by splitting the single energy level of the eigenspace 
into slightly differing energy levels, some of which then could have one dimensional 
eigenspaces, which would ensure ergodicity of these energy eigenvectors if any energy 
differences of exactly 27rn that they might have had with other eigenvectors are also 
destroyed by the sphtting of the energy levels. ■ 



Example of an ergodic system. Consider a spin- 1/2 particle at a fixed position 
in a magnetic field. Its state space is S) — C'^ and its Hamiltonian 



H 



E 
-E 



where £' e M. If we assume 



E - {-E) ^ 27rZ (3) 



then this system (Il(i^), a;, r) with the state u given by (1) in terms of the energy 
eigenvector 



is ergodic as explained above. This can also be checked directly by considering 



Oil ai2 

0>2l 0.22 



in which case ui{A) = an and 



On 0126^^^ 



1 'i-i - / 1 v-^n 

^ ~ \ n ^ ST'"- „-2iEk 



2iEk 
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so 



k=0 



— U 



ai2^ELoe^^^'= 

2 1 1 s^n g-2iBfe|2 



1 021 1 





p 1 
«2l| |;^Z^fe=o 

1 " 



k=0 



n /7 1 „2iEA: 

U «i2;^2^fe=oe 

/7 1 „-2iEk 



022 - On 



as n ^ oo, by the mean ergodic theorem 2.4.1, since e^^*^ 7^ 1 by (3), and |e^^''^'| = 
1. So in this direct way using 2.5.5, we again see that (£(^),ci;,r) is ergodic. Note 
that in this example there are A and B such that u;{AB) 7^ lj{BA), as asked in 
A.2.3.B 
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